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Abstract —The compute-and-forward framework permits each 
receiver in a Gaussian network to directly decode a linear 
combination of the transmitted messages. The resulting linear 
combinations can then be employed as an end-to-end commu¬ 
nication strategy for relaying, interference alignment, and other 
applications. Recent efforts have demonstrated the advantages of 
employing unequal powers at the transmitters and decoding more 
than one linear combination at each receiver. However, neither 
of these techniques fit naturally within the original formulation 
of compute-and-forward. This paper proposes an expanded 
compute-and-forward framework that incorporates both of these 
possibilities and permits an intuitive interpretation in terms of 
signal levels. Within this framework, recent achievability and 
optimality results are unified and generalized. 

Index Terms —interference, nested lattice codes, compute-and- 
forward, successive decoding, unequal powers 

I. Introduction 

Consider a Gaussian wireless network consisting of multiple 
transmitters and receivers. In this context, the compute-and- 
forward framework of [1] enables the receivers to decode 
linear combinations of the messages, often at much higher 
rates than what would be possible for decoding the individual 
messages. This strategy can be used as a building block 
for relaying strategies [2], MIMO integer-forcing transceiver 
architectures [3]—[8], or interference alignment schemes [9]- 
[ 11 ]. 

The coding scheme underlying this compute-and-forward 
framework maps the messages, which are viewed as elements 
of a vector space over a prime-sized finite field, to nested lat¬ 
tice codewords. The receivers are then able to decode integer- 
linear combinations of the codewords with coefficients chosen 
to approximate the real-valued channel coefficients (with better 
approximations yielding higher rates). Finally, these integer- 
linear combinations of lattice codewords are mapped back to 
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the vector space over the finite field to yield linear combina¬ 
tions of the original messages. In other words, compute-and- 
forward creates a direct connection between network coding 
over a finite field and signaling over a Gaussian channel. 

We now recall two simple properties of Gaussian networks 
with multiple transmitters and receivers. First, the amount 
of available power may vary across transmitters. Second, the 
noise variance may vary across receivers. Thus, we would like 
our coding scheme to be versatile enough to allocate power 
unequally across transmitters as well as space codewords far 
enough apart to tolerate the noise at the targeted receivers. 
For classical random coding strategies that aim to deliver 
subsets of the messages to the receivers, these two forms of 
versatility can be viewed as simply the flexibility to adjust 
the targeted signal-to-noise ratio (SNR) for each codeword. 
However, in the compute-and-forward setting, the receivers 
want linear combinations of the messages, and the effect of 
noise variance and power on the nested lattice codebooks is 
more nuanced than in the classical random coding setting. 
In particular, the codeword spacing for a given message is 
determined by the maximum noise variance across all receivers 
whose desired linear combinations involve the message. This 
codeword spacing corresponds to the density of the fine lattice 
from which the codewords are drawn. Additionally, the power 
level for a given message is determined by the power constraint 
of the associated transmitter. This power level corresponds to 
the second moment of the coarse lattice used for shaping. 

The goal of this paper is to expand the compute-and-forward 
to include these two forms of versatility while retaining the 
connection between the finite field messages and the lattice 
codewords. Prior work has focused on either varying the noise 
tolerances (i.e., the codeword spacings) or the power levels 
but not both. For instance, the framework in [1] permits the 
codewords to tune their noise tolerances but requires that 
the codewords have the same power level. In [12], Nam et 
al. proposed a nested lattice technique that permits unequal 
power levels for multiple transmitters that communicate the 
sum of their codewords to a single receiver over a symmetric 
channel (i.e., all channel gains are equal to one). However, this 
technique does not establish a connection between messages 
drawn from a finite field and lattice codewords. Part of the 
motivation for this paper is to unify the techniques from [1] 
and [12] into a single framework. 

The primary contribution of this paper is an expanded 
compute-and-forward framework that permits both unequal 
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powers and noise tolerances across transmitters while provid¬ 
ing a mapping between finite field messages and lattice code¬ 
words. Interestingly, this framework allows us to interpret both 
the power constraint and noise tolerance associated to each 
message in terms of “signal levels,” in a manner reminiscent of 
the deterministic model of Avestimehr et al. [13]. Specifically, 
each transmitter’s message is a vector from where p 
is prime. The power level of the transmitter determines a 
“ceiling” above which the message vector must be zero. 
Similarly, the noise tolerance of the transmitter determines a 
“floor” below which the message vector must be zero. The 
information symbols of the transmitter are placed between 
these constraints. 

Recent work has studied the problem of recovering multiple 
linear combinations at a single receiver. In particular, Feng et 
al. [14] linked this problem to the shortest independent vector 
problem [15] and a sequence of papers has demonstrated 
its value for integer-forcing MIMO decoding [3], [6]-[8] as 
well as for integer-forcing interference alignment [9]—[11]. 
However, the original compute-and-forward framework does 
not capture some of the subtleties that arise when decod¬ 
ing multiple linear combinations. For instance, as shown by 
Ordentlich et al. [9], after one or more linear combinations 
have been decoded, they can be used as side information to 
eliminate some of the codewords from subsequent linear com¬ 
binations. This algebraic successive cancellation technique 
eliminates some of the rate constraints placed on codewords, 
i.e., it enlarges the rate region. Also, as shown in [9], this 
technique can be used to approach the multiple-access sum 
capacity within a constant gap. Additionally, recent work by 
the first author [16] as well as Ordentlich et al. [7] revealed 
that decoded linear combinations can be used to infer the 
corresponding integer-linear combination of channel inputs, 
which can in turn be used to reduce the effective noise 
encountered in subsequent decoding steps. As argued in [7], 
this successive computation technique can reach the exact 
multiple-access sum capacity. 

Our expanded compute-and-forward framework is designed 
with multiple linear combinations in mind. Specifically, we 
use a computation rate region to capture the dependencies 
between rate constraints. Our achievability results broaden the 
algebraic successive cancellation and successive computation 
techniques to permit unequal powers as well as scenarios 
where the number of messages exceeds the number of desired 
linear combinations. We capture the prior results of [3], [7], 
[9], [16] as special cases and shed additional light on the 
structure of optimal integer matrices for successive decoding. 

Beyond unifying existing results, this expanded framework 
is meant to serve as a foundation for ongoing and future 
applications of compute-and-forward and integer-forcing tech¬ 
niques. For example, the initial motivation behind developing 
this framework was the authors’ exploration of integer-forcing 
for interference alignment [10]. Subsequently, He et al. [17] 
have used this framework to propose a notion of uplink- 
downlink duality for integer-forcing and Lim et al. [18] used 
it to propose a discrete memoryless version of compute-and- 
forward. 


A. Related Work 

The main concept underlying compute-and-forward is that 
the superposition property of the wireless medium can be 
exploited for network coding [19]-[21], This phenomenon was 
independently and concurrently discovered by [22]-[24], with 
the latter coining the phrase physical-layer network coding. 
Subsequent efforts [12], [25], [26] developed lattice coding 
strategies for communicating the sum of messages to a single 
receiver. This lead to the compute-and-forward framework [1] 
for multiple receivers that recover linear combinations of the 
messages (albeit with equal power constraints, unlike the 
single receiver framework of [12]). 

As shown by Feng et al. [14], any compute-and-forward 
scheme based on nested lattice codes can be connected to 
network coding over a finite commutative ring. From this 
algebraic perspective, the compute-and-forward framework 
of [1] can be viewed as a special case that connects nested 
lattice codes generated via Construction A to network coding 
over a prime-sized finite field. Another important special case 
is the recent work of Tunali et al. [27] that develops a compute- 
and-forward scheme based on nested lattices over Eisenstein 
integers. For complex-valued channels, this scheme can offer 
higher computation rates on average (e.g., for Rayleigh fading) 
since the Eisenstein integers are a better covering of the 
complex plane than the Gaussian integers employed by [1], 
Several recent papers have also used the algebraic perspective 
of [14] to propose practical codes and constellations for 
compute-and-forward [28]— [3 1], 

The line of work on compute-and-forward is part of a 
broader program aimed at uncovering the role of algebraic 
structure in network information theory, inspired by the paper 
of Korner and Marton [32]. For instance, there are advantages 
for using algebraic structure in coding schemes for dirty 
multiple-access [33]-[35], distributed source coding [36]-[40], 
relaying [2], [5], [41]—[44], interference alignment [9], [10], 
[45]-[49], and physical-layer secrecy [50]—[53] . Many of these 
works benefit from the development of lattice codes that are 
good for source and channel coding as well as binning [54]- 
[59], We refer readers to the textbook of Zamir [60] for a 
full history of these developments, an in-depth look at lattice 
constructions, achievable rates, and applications as well as a 
chapter [61] on the use of lattice codes in network information 
theory. 

In recent, independent work, Zhu and Gastpar [62] proposed 
a compute-and-forward scheme for unequal powers based on 
the scheme of [12]. They also showed how to use this scheme 
to reach the two-user multiple-access sum capacity (if the 
channel strength lies above a small constant). However, their 
scheme does not retain the connection to a finite field. 

The original motivation for the compute-and-forward strat¬ 
egy was the possibility of relaying in a multi-hop network 
while avoiding the harmful effects of interference between 
users (by decoding linear combinations) as well as noise 
accumulation (by decoding at every relay). Several works 
have investigated and improved upon the performance of the 
original compute-and-forward framework in the context of 
multi-hop relaying [41], [43], [48], [63]—[65]. Our expanded 
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framework can improve performance further by permitting 
relays to employ unequal powers and decode multiple linear 
combinations when appropriate. 1 

B. Paper Organization 

We have strived to present our results, some of which are 
rather technical, in an accessible fashion. To this end, we begin 
in Section II with an informal overview of our framework to 
build intuition, before giving a formal problem statement. We 
then state our main results in Section III without using any 
lattice definitions or properties. Afterwards, we introduce our 
nested lattice code construction in Section IV and proceed to 
prove our main achievability theorems in Sections V and VI. 

II. Problem Statement 

In this section, we provide a problem statement for the 
expanded compute-and-forward framework. As mentioned ear¬ 
lier, our message structure can be interpreted in terms of 
signal levels that resemble the deterministic model of Aves- 
timehr et al. [13 ]. 2 Unlike the original compute-and-forward 
problem of [1], we will not aim to directly decode linear 
combinations of the messages. Instead, we associate each 
message realization with a coset and aim to decode linear 
combinations of vectors that belong to the same cosets as 
the transmitted messages. We describe a simple method of 
understanding this class of linear combinations through the use 
of “don’t care” entries. As we will argue, if the coefficients 
of the linear combinations would suffice to recover a given 
subset of the messages in the original compute-and-forward 
framework [1], they also suffice to recover these messages in 
the expanded compute-and-forward framework. For example, 
if the receiver obtains a full-rank set of linear combinations, 
all of the transmitted messages can be recovered successfully. 

For the sake of conciseness, we will focus on real-valued 
channel models with additive white Gaussian noise (AWGN). 
Our coding theorems can be applied to complex-valued chan¬ 
nel models either via a real-valued decomposition of the chan¬ 
nel [1], [3] or by building nested lattice codebooks directly 
over the complex field using either Gaussian or Eisenstein 
integers [14], [68], While our framework is intended for 
AWGN networks with any number of sources, relays, and 
destinations, we find it clearer to first state our main results 
from the perspective of a single receiver that wishes to decode 
one or more linear combinations. In Section III-D, we will 
show how to apply our coding theorems to scenarios with 

'Following the conference publication of this work. Tan et al. [66] noted 
that our proposed message representation may be inefficient for multi-hop 
relaying if each relay naively treats a linear combination over Zp as k 
information symbols for the next hop. They proposed a lattice-based solution 
to this issue. It is also possible to resolve this issue directly over the message 
representation by having each relay only use some of its k received symbols 
as information symbols for the next hop. See [67, Section III.F] for details. 

2 Unlike [13], we do not propose a deterministic model for analyzing 
communication networks. Instead, here, we use a deterministic model as an 
expository tool to explain the decoding requirements of our problem statement. 

3 For the case of Eisenstein integers, our lattice achievability proof from 
Theorem 8 will need to be generalized following the approach of Huang et 
al. [27], 


multiple receivers. Below, we state our notational conven¬ 
tions, essential definitions for our channel model, a high-level 
overview of the compute-and-forward problem, and a formal 
problem statement. 

A. Notation 

We will employ the following notation. Lowercase, bold 
font (e.g., x) will be used to denote column vectors and 
uppercase, bold font (e.g., H) will be used to denote matrices. 
For any matrix H, we denote the transpose by H T , the span 
of its rows by rowspan(H), and its rank by rank(H). The 
notation ||x|| denotes the Euclidean norm of the vector x while 
A m in(H) and A max (H) denote the minimum and maximum 
singular values of H, respectively. We will denote the all-zeros 
column vector of length k by 0/., the k x n all-zeros matrix 
by Ofc Xn , the all-ones column vector of length k by l/ i; , and 
the k x k identity matrix by I/.. We will sometimes drop the 
subscript when the size can be inferred from the context. The 
log operation will always be taken with respect to base 2 and 
we define log + (cc) = max(0, log(:r)). 

Our framework will make frequent use of operations over 
both the real field R and the finite field consisting of the 
integers modulo p , Z p = {0,1,... ,p — 1}, where p is prime. 4 
Addition and summation over R will be denoted by + and 
Y* respectively. Similarly, addition and summation over the 
finite field Z p where p is prime will be denoted by ® and 
0, respectively. We will write the modulo-/; reduction of an 
integer a £ Z as [a] mod p = r where r £ Z p is the unique 
element satisfying a = qp+r for some integer q. It will also be 
convenient to write the elementwise modulo-/; reduction of an 
integer vector a £ Z L and an integer matrix A £ Z J ' /xL as 
[a] mod p and [A] mod p, respectively. Recall that addition 
and multiplication over Z p are equivalent to addition and 
multiplication over R followed by a modulo-p reduction, i.e., 

qiWi © q 2 w 2 = [qiWi + q 2 w 2 ] mod p 

where qi 1 q 2 ,w\,w 2 £ Z p . Note that on the right-hand side of 
the equation above, we have implicitly viewed q\ 1 q 2 , wi,w 2 as 
the corresponding elements of Z (under the natural mapping) 
in order to evaluate the real addition and multiplication. This 
will be the case throughout the paper: whenever elements of 
Z p appear as part of operations over the reals, they will be 
implicitly viewed as the corresponding elements of Z. 

B. Channel Model 

Consider L single-antenna transmitters that communicate 
to a receiver over a Gaussian multiple-access channel. See 
Figure 1 for an illustration. Each transmitter (indexed by 

4 Historically, the set of integers modulo p has been denoted by Z/pZ, 
Z/(p), or Z p . Some mathematicians prefer to avoid the notation Z p because 
it can be confused with the set of p-adic integers if p is a prime number. Here, 
we will use the notation Z p for the sake of conciseness, especially since we 
will frequently refer to vector spaces of the form Z^ (and have no need to 
refer to p-adic integers). 
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£ = 1,2,..., Z/) produces a length-n channel input E M n 
subject to the power constraint 5 

E||x f || 2 < nP< (1) 

where Pp > 0. 



Fig. 1. Block diagram for the compute-and-forward problem with a single 
receiver. Each transmitter has a message whose elements are taken from 
Z p . This message is embedded into Z^ (by zero-padding), mapped into a 
codeword E M n , and sent over the channel. The receiver observes a noisy 
lineal' combination of these codewords, Y = Yle. h^xj + Z and attempts to 
recover the linear combinations ui, 112 ,..., of the coset representatives 
of the original messages. 


The receiver has N v antennas and observes an N r x n di¬ 
mensional channel output Y that is a noisy linear combination 
of the inputs: 

L 

Y = ]T h/xj + Z 

t=i 

where £ HA' is the channel vector between the £th 
transmitter and the receiver and Z £ M ;Y|Xn is elementwise 
i.i.d. Af(0, 1). It will often be convenient to group the channel 
vectors into a channel matrix 

H — [hi h 2 ••• h L ] , 

and concisely write the channel output as 

Y = HX + Z (2) 

where X = [xi • ■ • x/,] T is the matrix of channel inputs. We 
will assume throughout that the channel matrix H is known 
to the receiver and unknown to the transmitters. However, the 
transmitters may assume that the maximum singular value 
Amax(H) of the channel matrix is upper bounded by a con¬ 
stant. 6 

'In [ 1 , Appendix C], it is argued that, for symmetric compute-and-forward, 
the expected power constraint E[||x/|| 2 ] < nP can be replaced with a 
hard power constraint ||x|| 2 < nP without affecting the achievable rates. A 
similar argument should apply in our setting by first refining the nested lattice 
existence proof in [59, Theorem 2] to show that the coarse lattices are also 
good for covering [58], Alternatively, each encoder can throw out a constant 
fraction of its codebook to obtain a subcodebook that satisfies a hard power 
constraint while still maintaining the same achievable rate asymptotically. 

6 This assumption will be used to make a connection between the solvability 
of the linear combinations over and the rank of the integer matrix A over 
R. If we further assume that the channel matrix is generated randomly and the 
receiver is able to tolerate some probability of outage, then this condition can 
by replaced by the milder condition that P(A ma x(H) > 7 ) —> 0 as 7 —» 00 . 
See [1, Remark 10] for further details. 


C. High-Level Overview 

We now provide a high-level, informal overview of our 
compute-and-forward framework, which will help build intu¬ 
ition for the formal problem statement to follow. We begin 
by summarizing the original compute-and-forward framework 
from [1] and its MIMO generalization from [3], We then 
discuss how to incorporate unequal power constraints and 
recovering multiple linear combinations into an expanded 
framework. 

Compute-and-Forward with Equal Powers: The /th trans¬ 
mitter’s message is a length-/,-/ vector wp whose elements are 
from Jj p where p is prime. Each message is zero-padded to a 
common length 


where k = max/ kp, and mapped to a length-n real-valued 
codeword X/ that satisfies the symmetric power constraint 
E[||x £ || 2 ] < nP. The rate Rp associated with a transmitter 
is the number of bits in its message normalized by the 
length of the codeword, Rp = ( kp/n ) logp. Given coefficients 
<2T, 52 , • ■ ■, <1l £ Zip, the receiver’s goal is to recover a linear 
combination u of the (zero-padded) messages. 


L 

u = Q)qpwp. 


As argued in [1], the main idea underlying compute-and- 
forward is to establish a connection between linear combina¬ 
tions of the messages and integer-linear combinations of the 
codewords, in order to exploit the noisy linear combination 
taken by the channel. For instance, after applying an equaliza¬ 
tion vector b £ M Vr , the channel output can be expressed as an 
integer-linear combination of the codewords 7 with coefficients 
ai, d 2 ,..., az, £ Z plus effective noise, 

L L 

b T Y = ^ apxj + ^ (b T h/ — ap)xp + b T Z . (3) 

1 =1 e=i 

v--- 

effective noise 


Each integer-linear combination of codewords is associated 
with a linear combination of the messages with coefficients 
qt = [ci^] mod p. The performance of a compute-and-forward 
scheme is given by a computation rate region, which is 
specified by a function f? com p(H,a) that maps each channel 
matrix H and integer coefficient vector a = [ai a 2 ■ • • o,/,] T 
to a rate. Specifically, if the rates associated to messages with 
non-zero coefficients are less than the computation rate 


max Rp <C /(comp(PI- a) . 

£:ci£^0 

then the linear combination with coefficients qp = [ap] mod p 
is decodable with vanishing probability of error (with respect 
to the blocklength n). Operationally, this means that the 
scheme works in the absence of channel state information 
at the transmitter (CSIT) and that the receiver is free to 


7 To be precise, our coding scheme employs dithered lattice codewords as 
the channel inputs xp. However, the dithers can be removed at the receiver 
prior to decoding, and are thus ignored in this high-level overview. 
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choose which linear combination to decode, among those 
satisfying the computation rate constraint. Owing to this form 
of universality, compute-and-forward is applicable to scenarios 
with multiple receivers, each facing a different channel matrix 
and aiming to decode its own linear combination. 

It shown in [1, Theorem 1] and [3, Theorem 3] that the 
computation rate region described by 


-Rcomp(H,a) — ^ log 



p 

(P -1 I + H T H) 



( 4 ) 


is achievable. The achievability proof utilizes nested lattice 
codebooks, which guarantees that any integer-linear combi¬ 
nation of codewords is itself a codeword and thus afforded 
protection from noise. In particular, each transmitter’s code¬ 
book is constructed using a fine lattice with effective noise 
tolerance cr 2 fr /: and a common coarse lattice that enforces the 
power constraint P. These nested lattices are chosen such that 
the £th transmitter’s rate R p converges to 4 log + (P/<r 2 ff p ) 
asymptotically in the blocklength n. The nested lattice con¬ 
struction sends the field size p to infinity with the blocklength 
n, in order to produce Gaussian-like channel inputs and obtain 
closed-form rate expressions. 

Remark 1: If the field size is held fixed, we encounter 
similar issues as seen when evaluating the capacity of a point- 
to-point Gaussian channel under a finite input alphabet, i.e., 
we do not obtain closed-form rate expressions. For practical 
point-to-point codes, a common approach is to pick a finite 
constellation size based on the SNR [69] and accept a small 
rate loss. A similar approach enables practical codes for 
compute-and-forward [14], [31], 0 

Remark 2: Since the field size p changes with the block- 
length n, it does not make sense to specify the desired linear 
combinations via fixed coefficients q p £ Z p . Instead, we fix 
desired integer coefficients at and specify the desired linear 
combinations as those with coefficients satisfying q rn p = 
K,<] mod p. 0 

The effective noise tolerance of the codeword associated 
to Yle «t x / : is determined by the minimum noise tolerance 
over all participating fine lattices, min^ :a£5 £o cr 2 ff Roughly 
speaking, an integer-linear combination is decodable if the 
variance of the effective noise in (3) is less than its effective 
noise tolerance. It can be shown that the denominator in (4) 
corresponds to the variance of the effective noise when b is 
chosen as the minimum mean-squared error (MMSE) projec¬ 
tion. 

Compute-and-Forward with Unequal Powers: In this paper, 
we expand the original compute-and-forward framework [ 1 ] in 
two aspects. First, we allow for an unequal power allocation 
across transmitters. Second, we explicitly consider the scenario 
where the receiver may wish to recover more than one linear 
combination. Decoding more than one linear combination 
appears in many contexts, such as recovering the L transmitted 
messages in an integer-forcing MIMO receiver [3], relaying 
in a network where there are more transmitters than relays, 
and integer-forcing interference alignment [10]. In order to 
incorporate these two generalizations, we will expand the 
definition of the computation rate region. Here, we provide 


an intuitive description of our modifications before presenting 
a formal problem statement. 

We first describe our modification to the message structure. 
To each transmitter, we associate a power constraint Pg and, as 
before, an effective noise tolerance p . In the equal power 
setting, the rates varied across transmitters due only to the 
change in the effective noise tolerance. To cope with the 
fact that the messages have different lengths, they are zero- 
padded prior to taking linear combinations. Here, the rates 
will vary due to both changes in power and effective noise 
tolerance, for which zero-padding will not suffice. Instead, we 
take inspiration from the idea of signal levels as introduced 
in [13], 

The length of the fth transmitter’s message is a length- 
(k-p.i — kc,e) vector w/ whose elements are drawn from Z p 
where p is prime and the parameters kcj:- k F p £ N will be 
determined by the power constraint Pg and effective noise 
tolerance er 2 ff respectively. Define kp = min/ kpt and 
kp = max/ kpj : . The total number of available signal levels 
is k = k F - fe c . Each message is embedded into Z p using 
kc,e — kp leading zeros and k F — k F j trailing zeros. 


Ofcc,/—fee 

W/ 

Ofc F —kpp 


( 5 ) 


and mapped to a length-n real-valued codeword x/ that satis¬ 
fies the power constraint E[||x/|| 2 ] < nPi. The rate R p asso¬ 
ciated with this transmitter is the number of message bits nor¬ 
malized by the codeword length, Ri = ((k F / — kc,i)/n) logp. 


© 

r 

0 


© 

0 

w 2 < 

0 


© 

0 

l 

0 

r 

0 

0 


• 

w 3 ^ 

0 

0 


• 

0 

• 


• 

i 

0 


[wi] [w 2 ] [w 3 ] 


Fig. 2. Illustration of message cosets for L = 3 transmitters. The parameters 
are /cp,i = 7, fcp ,2 = 5, &f ,3 — 8 , &c,i = 3, fcc ,2 — 2, and /cc ,3 = 4. 
Therefore, kp = 8 , kc = 2, and k = 6 . We use the symbol * to stand for a 
“don’t care” entry that can take any value in Z p , lightly shaded (blue) circles 
to denote information symbols (i.e., elements of w^) that can take values in 
Z p , and black circles to denote zeros. Based on the parameters, the coset 
[wi] associated with the first transmitter has 1 “don’t care” entry, then 4 
information symbols, and then 1 zero. Similarly, the coset [W 2 ] associated 
with the second transmitter has 3 information symbols followed by 3 zeros. 
Finally, the coset [W 3 J associated with the third transmitter has 2 “don’t care” 
entries followed by 4 information symbols. The receiver’s goal is to recover 
a linear combination of vectors drawn from these cosets. 


As mentioned above, the receiver may wish to decode 
more than one linear combination. We compactly represent 
the receiver’s demands through L desired linear combinations 
ui, U 2 ,..., u l of the form 

L 

u m = (£) q m , <W{ 

1 =1 

where q m> g S Z p and w/ is an element of a certain coset 
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[wfj with respect to the message w^. 8 Specifically, the coset 
consists of all vectors in for which the first kc,i ~ kc 
elements can take any values in Z p (and can be viewed as 
“don’t care” entries), the next kpj : — kc.f elements contain the 
message W|, and the remaining At — kpj entries are equal to 
zero. 

It is convenient to group the coefficients into a matrix 
Q = {<7rri. t}- If the receiver wants fewer than L linear 
combinations, then it can set the entries of the unneeded 
rows of Q to zero. We have illustrated an example message 
structure in Figure 2. Note that this framework includes 
the original problem statement as a special case by setting 
^’c,i = kc,2 = ... = fcc.L- 

We have relaxed the decoding requirements by allowing the 
receiver to decode linear combinations of vectors drawn from 
the same cosets as the message vectors. However, this does not 
affect the algebraic conditions for recovering messages from 
their linear combinations. Specifically, if the coefficient matrix 
Q enables the receiver to recover w^, it can also immediately 
recover wg. For example, if Q is full rank, the receiver can 
recover all L transmitted messages. 

Our coding scheme will employ nested lattice codes in order 
to link linear combinations of the messages to integer-linear 
combinations of the codewords, just as in the symmetric case. 
We will select kc,e using the transmitter’s power constraint 
P{ and kt j- using the effective noise tolerance cr) fr so that 
the Hh transmitter’s rate Rg converges to i log + (Pf/cr? ff ^) 
asymptotically in the blocklength n. As before, the field size 
p tends to infinity with the blocklength n. Thus, we select 
desired integer coefficients a m ,t € Z and specify the de¬ 
sired linear combinations as those with coefficients satisfying 
q m ,e = [a m ,e] mod p. 

The channel output can be written as an integer-linear 
combination of the codewords plus effective noise as in (3). 
We consider L such integer-linear combinations and collect 
the desired coefficients into an integer coefficient matrix 
A = {a m /}, which in turn specifies the coefficient matrix 
as Q = [A] mod p where the modulo operation is taken 
elementwise. 

We now describe our modification to the computation rate 
region definition from [1], Unlike the equal power setting, 
the rate region cannot be described using a single compu¬ 
tation rate. For example, even if we are interested in only 
recovering a single linear combination at the receiver, the 
computation rate for each transmitter will still be determined 
by its own power combined with the effective noise for the 
linear combination. As another example, consider an equal 
power setting where the receiver wishes to decode more than 
one linear combination. Once it has decoded a single linear 
combination, it can use it as side information to help decode 
the next, meaning that the rate constraints for the linear 
combinations should be considered jointly. To capture such 
phenomena, we characterize the performance of a compute- 
and-forward scheme via a set-valued computation rate function 
7?. COI n P (H. A) that maps each channel matrix H and integer 

8 To be precise, the linear combinations are affine varieties (i.e., translates 
of a vector subspace). However, we will simply refer to these as linear 
combinations throughout the paper. 


r 2 





^-comp(H3 

) A3) 



T^-comp (H2, A 

2 ) 




T^-comp (Hi, Ai) 

• 




Ri 

Fig. 3. Sample evaluations of the computation rate region for L = 2 
transmitters. The dot denotes the rate tuple for the two transmitters. Since 
this rate tuple falls inside the (red) rate region 7£ C omp(H2, A2), the receiver 
can recover the linear combinations with integer coefficient matrix A2 under 
channel matrix H2 . Similarly, the linear combinations with integer coefficient 
matrix A3 can be recovered under channel matrix H3 since the rate tuple falls 
inside the (green) rate region 7^. C omp(H3, A3). On the other hand, since the 
rate tuple falls outside the (blue) rate region 7 ^. C omp(Hi, Ai), the receiver is 
not required to recover the linear combinations with integer coefficient matrix 
Ai under channel matrix Hi. 

coefficient matrix A to a subset of R+. This subset consists 
of all rate tuples that are achievable for the specified H and 
A under the chosen coding scheme. That is, if the rate tuple 
associated to the messages falls inside the computation rate 
region, 

(f?l, f?2, • ■ ■ ) Rl) £ 7^comp(H, A) , 

then the linear combinations with coefficient matrix Q = 
[A] mod p are decodable with vanishing probability of error 
(with respect to the blocklength n). See Figure 3 for an 
illustration explaining the computation rate region. 

D. Formal Problem Statement 

We now provide a formal problem statement. A coding 
scheme is parametrized by the following: 

• A positive integer n denoting coding blocklength, 

« a positive prime number p denoting the size of the finite 
field Z p over which the linear combinations are taken, 
and 

• non-negative integers kc,g,kp t g for £ G {1,2, ...,£} 
satisfying kc,g < kp/ < n, where kp t g — kc/ can be 
interpreted as the number of available signal levels at the 
Mi transmitter. 

Definition 1 (Messages): For l = 1, 2,..., L, the Mi trans¬ 
mitter has a message wg that is drawn independently and 
uniformly over Z p F f c,e . The rate of the Mi message (in 
bits per channel use) is 

kp e — kc,e , 

—-- logp • 

n 

0 

Definition 2 (Encoders): For l = 1,2,..., L, the Mi trans¬ 
mitter is equipped with an encoder Eg : Z p F,f C ' 1 —> R" that 
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maps its message into a channel input vector = £((’Wp) 
subject to the power constraint (1). 0 

Definition3 (Decoder): Define kc — min^ fcc,r, kv = 
maxf kv/, and k = k? — kc- Also, define the coset 



r 

r 

w,J4 

w G Zp : w = 





for some r 


G Zp c ’ e ~ kc 


( 6 ) 

The receiver is equipped with a decoder V : R JV,X " x M. N,xL x 
Z LxL -a Z kxk that takes as inputs the channel observation Y 
from (2), the channel matrix H, and the desired integer coef¬ 
ficient matrix A, and outputs an estimate U = D(Y. H. A), 
Let denote the mth row of U. We say that decoding is 
successful if Ui = ui, U 2 = 112 ..... u/\ = u;, for some linear 


combinations of the form 


L 

1^=0 

t=l 


where the q m G Z p are the entries of Q = [A] mod p and 
wp G [w/]. We say that the decoder makes an error if it is 
not successful. We sometimes refer to U], u a ,..., Ul as linear 
combinations with integer coefficient matrix A. <) 

Definition 4 (Computation Rate Region): A computation 
rate region is specified by a set-valued function 7^. comp (H, A) 
that maps each channel matrix H G S> NrXL and integer 
coefficient matrix A G Z LxL to a subset of K^. 
The computation rate region described by a set-valued 
function TZ com p is achievable if, for every rate tuple 
(i?i, i? 2 > • • • j Rl) G R+, e > 0, and n large enough, there 
exist 


« parameters p, fcc.G ^'fr satisfying 
Rp — e for £ = 1,2,..., L , 

• encoders £ 1 ,£ 2 , ...,£l, 


ky,p — kc ,1 
n 


log p 


> 


such that. 


for every channel matrix H G R A7 ' ,X L and 

every integer matrix A, satisfying (Ri, R 2 , ■ ■ ■, Rl) £ 

TZcomp (H, A), 


there exists a decoder V with probability of decoding error at 
most e. 0 

Remark 3: The usual approach to defining a rate region 
is to first define the notion of an achievable rate tuple, and 
then define the rate region as the set of all achievable rate 
tuples. Our definition does not have this structure because the 
encoders £ 1 , £ 2 ,... ,£l are assumed to be ignorant of both 
the channel matrix H and the integer coefficient matrix A. 
Thus, a rate tuple (Ri, R 2 , ■ • ■, Rl) selected by the encoders 
will not lead to successful decoding for all (H, A) pairs. 
Instead, we characterize the rate region via the set-valued 
function 7?, comp (H, A), which specifies the rate tuples that lead 
to successful decoding for each (H, A) pair. 0 

Remark 4: In some cases, it may be possible to simplify the 
framework by setting all of the “don’t care” entries to zero, 
i.e., setting r = 0 in (6). For example, if the encoders do 
not dither their lattice codewords prior to transmission in the 
proof of Theorem 1, then it follows from Definition 11 that 
this is possible. However, this will significantly complicate the 


proof, since the effective noise will not be independent from 
the desired linear combination. More generally, this problem 
statement is directly applicable for compute-and-forward over 
discrete memoryless networks [18]. In that setting, the “don’t 
care” entries play an important role for selecting codewords 
with the desired type, following the joint typicality encoding 
approach of Padakandla and Pradhan [70], 0 

III. Main Results 

In this section, we state our main coding theorems as well 
as provide intuitions and examples. Although the proofs of our 
achievability results rely on the existence of good nested lattice 
codes, we have deferred (nearly all) discussion of lattices 
to subsequent sections in order to make our main results 
more accessible. Our primary technical contribution is the 
generalization of the compute-and-forward framework to allow 
for unequal powers, multiple receive antennas, and recovering 
more than one linear combination (at a single receiver), 
all while maintaining a connection to Z k . We demonstrate 
the utility of our generalization of the compute-and-forward 
framework by applying it to the classical and compound 
Gaussian multiple-access channels. 

Note that all of our results implicitly assume that the 
transmitters do not have access to channel state information. 
It is well-known that, with channel state information, the 
transmitters can steer the channel gains towards integer values, 
which can improve the end-to-end rates [41], [71]. This can be 
captured within our framework by multiplying each channel 
gain by a scalar (chosen using channel state information). 
More generally, the achievable rates of linear beamforming 
strategies for multi-antenna transmitters can be captured within 
our framework by multiplying each transmitter’s channel 
vector (on the right) by its beamforming vector. See [10] 
for an application to interference alignment and [17] for an 
application to uplink-downlink duality. 

We now provide a high-level summary of our results: 

« Parallel Computation: Theorem 1 expands the compu¬ 
tation rate region from [1] by permitting unequal power 
allocation across the transmitters. Each of the desired 
linear combinations is decoded independently of the 
others, which we refer to as “parallel computation.” 

• Successive Computation: Theorem 2 enlarges the com¬ 
putation rate region from Theorem 1 by decoding the lin¬ 
ear combinations one-by-one and employing successive 
cancellation. This “successive computation” technique 
can be viewed as a generalization of [7], [16] to the 
unequal power setting. Theorem 3 shows that it suffices 
to use “primitive” integer coefficient matrices, i.e., integer 
matrices with a unimodular completion. 

• Multiple-Access Sum Capacity within a Constant 
Gap: Theorem 4 shows that the sum of the L highest 
parallel computation rates always lies within L bits of the 
sum capacity of the underlying multiple-access channel. 
Furthermore, these computation rates can mapped to 
individual users, which leads to an operational interpre¬ 
tation as a multiple-access strategy. An implication of 
the theorem is that the parallel computation strategy. 
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when combined with algebraic successive cancellation, 
is approximately optimal for multiple access. From one 
perspective. Theorem 4 generalizes the compute-and- 
forward transform of [9] to the unequal power setting. 

« Multiple-Access Sum Capacity: Theorem 5 shows that, 
for any unimodular integer coefficient matrix, the sum of 
the L successive computation rates is exactly equal to the 
sum capacity of the underlying multiple-access channel. 
Under certain technical conditions, this can be employed 
as an optimal multiple-access strategy. Theorem 5 gener¬ 
alizes the successive integer-forcing scheme of [7] to the 
unequal power setting. 

• Multiple Receivers: Theorems 6 and 7 give achievable 
rate regions for multiple receivers for parallel and suc¬ 
cessive computation, respectively. 

We now introduce some additional notation. Define P to be 
the diagonal matrix of the power constraints, 

P = diag(P 4 , P 2 ,..., Pl) , 

and let F £ R ixi be any matrix that satisfies 

F T F = (P ^ 1 +H T H ) _1 . (7) 

Note that F is not unique and can determined via several 
approaches, such as via its eigendecomposition or its Cholesky 
decomposition. 

Recall that A £ Z ixi is the desired integer coefficient 
matrix. Let aj n denote the mth row of A and o r „ j denote the 
(to, £)th entry. We will sometimes refer to u m as the linear 
combination with integer coefficient vector a m . In certain 
scenarios (e.g., relaying, interference alignment), the receiver 
may wish to decode M < L linear combinations. This can 
be explicitly represented in our framework by setting the last 
M — L rows of A to be zero but it will be convenient to 
develop more compact notation. Let A be an M x L integer 
matrix with M < L. We will implicitly take P C omp(H, A) to 
mean P. C omp(H, A) where 



We also recall the following basic result from linear algebra. 
Lemma 1 (Woodbury Matrix Identity): For any 

(appropriately-sized) matrices Mp, M2, M3, and M4, 
we have that 

(Mi + M 2 M 3 M 4 ) _1 

= Mf 1 - M^Ma (M3 1 + M 4 MC 1 M 2 ) _1 M 4 M^ 

□ 

See, e.g., [72, Theorem 18.2.8] for a proof. As an example, 
take M 1 = P 1 , M 2 = H T , M 3 = I, and M 4 = H. It then 
follows from the Woodbury Identity that 

(P 1 + H t H) 1 = P PH t (I + HPH t ) _1 HP . (8) 

We will make frequent use of this identity. 


A. Parallel Computation 

We begin with a “parallel computation” strategy, in which 
the receiver decodes each of the desired linear combinations 
independently. To recover the mth linear combination, the 
receiver applies an equalization vector b m £ R ;V| to its 
observation to obtain the effective channel 

y m = bJ„Y 

= b^HX + bJ n Z 

= a^X + (bJ n U- a T m )X + b T m Z , 

S --- / 

effective noise 

and then decodes to the closest lattice codeword. In Section V, 
we will argue that, if the fine lattices associated with a^ X can 
tolerate an effective noise variance of 

°pa ra (H, a m ,b TO ) 4 iE||(bTH-aT)X + bTzf 

= ||b m || 2 +||(bTH- a T)pl/ 2|| 2 , (9) 

then the linear combination with integer coefficient vector a m 
can be successfully decoded. 

Lemma 2: The equalization vector b m £ R ;V| that min¬ 
imizes the effective noise variance from (9) is the MMSE 
projection vector 

b 0 T ptim =aT i PH T (l + HPH T )- 1 . 

The minimal effective noise variance is 

^paral^-? a ra) = cr para(-^-’ bopt,ra) (10) 

= a T(p- 1 +H T H)- 1 a m 

= ||Fa m || 2 , (11) 

where F is any matrix that satisfies (7). □ 

See Appendix A for a proof. 

The users that participate in the integer-linear combination 
X are simply those with non-zero integer coefficients, 
a m ,i 7 ^ 0. These users should satisfy the rate constraints 
Ri < l/21og + (Pf/cr 2 ara (H, a m )) in order for the receiver to 
directly decode the linear combination. Note that the receiver 
can also decode linear combinations indirectly. Specifically, 
it can decode any integer-linear combinations whose integer 
coefficient matrix A £ Z LxL has the same rowspan as A, 
and then simply solve for AX from AX. Therefore, the 
achievable computation rate region involves a union over all 
integer matrices with the same rowspan, as captured by the 
following theorem. 

Theorem 1: For an AWGN network with L transmitters, 
a receiver, and power constraints Pi,...,Pl, the following 
computation rate region is achievable, 

7^(H,A)= (J7WH.A) 

Aez ix - L 

rowspan(A)Crowspan(A) 

where 

7^ para (H, A) 4 J : 
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Re < 2 lo g 


Pe 


a(H, a m ) 


V(?7t,£) S.t. Qm,e 7 ^ 0 


and and d m ,e are the mth row and (m, £)th entry of A, 
respectively. □ 

The achievability proof is presented in Section V. 

Remark 5: The computation rate region described in Theo¬ 
rem 1, when restricted to the special case of equal powers and 
a single antenna at the receiver, yields the rate region from [1, 
Theorem l]. 9 0 

The following lemma restricts the search space for integer 
vectors. 

Lemma 3: Let A ma x(l + PH t H) denote the maximum 
eigenvalue of I + PH T H. Consider an integer matrix A £ 
Z LxL and user index £ £ {1,2If, for some m £ 
{1,2,..., L}, the (to, £)th entry of A satisfies 

a m,e A A max (l + PH t H) , 

then R( = 0 for any rate tuple (f? 1; ..., R L ) £ 7^ para (H, A). 

□ 

The proof is deferred to Appendix B. 

Example 1: Consider a receiver that observes Y = A'i + 

• • • + Xl + Z and wants to decode the linear combination with 
integer coefficient vector ai = 1 (i.e., the sum of the messages 
over Z p ). (For L — 2, this corresponds to the multiple-access 
phase in a Gaussian two-way relay channel as studied by [26], 
[73].) Using (8), we get 

4, ra (l T 1 T ) = 1 T P1 - 1 T P1(1 + l T Pl) VPI 

Eti Pi 

i + Eti Pi ' 

The resulting rate region is 
^ para (l T ,l T ) 

= ..., e K+ : Re < - log + ^ 1 p j 

and is equal to that derived by Nam, Chung, and Lee [12] for 
decoding the sum of lattice codewords over a multiple-access 
channel with unequal powers. 

It is well-known that this region can be expanded at low 
SNR by decoding the messages individually and then com¬ 
puting the sum. That is, the rowspace of A = I contains that 
of A and yields the rate region 


^ paia (l T ,I) 

GR+ : Re < \ log (l + „ ^ 

The computation rate region can be further expanded by taking 
the union over all viable A, e.g., by first decoding sums of 
subsets of the messages and then combining these. 0 




technically speaking, our expression of the achievable rate region slightly 
generalizes the region described in [1] since we explicitly take a union over 
the set of integer matrices A that contain the rowspan of A. This possibility 
is discussed in [ 1, Remark 7] but not formally included in the statement of [ 1 , 
Theorem 1]. 


Example 2: Consider a receiver that wishes to recover all 
of the messages, A = I. Let denote the mth column of I. 
Using ( 8 ), the effective noise variances are 

°"p ara (H, 6 m ) =S T m (P - PH T (I + HPH t ) _1 HP )S m 
= Pm~ PlKa (I + HPH T ) _1 h m . 

The effective SNR of the £th user is 

Pe = _1_ 

^para (H, St) i _ p e hj (I + HPH t ) \f: 

= 1 + P t hj (i + Y, Pihihjy'he 

where the last step uses the Woodbury Matrix Identity 
(Lemma 1) with Mi = 1, M 2 = P^ 2 h^, M 3 = (I + 
Y^i^e PihihJ) , and M 4 = M{. Finally, the rate region 
(for direct decoding) is 

ftparaCH,!) = SM+ : 

Re < 7; log + PehJ 

which matches the rates attainable via i.i.d. Gaussian code¬ 
books and treating interference as noise (see, e.g., [74, Equa¬ 
tion 8.69]). As shown in [3] (for equal powers), the rate 
region for recovering all of the messages can be significantly 
expanded by integer-forcing decoding, i.e., taking the union 
over all rank-L matrices A in Theorem 1. See Section III-C 
for more details. 0 

As hinted in the example above, a receiver can recover 
all L transmitted messages if the coefficient matrix Q = 
[A] mod p has rank L over 7L p and, under mild technical 
conditions, we can simply check if the integer coefficient 
matrix A itself has rank L over R. More generally, to recover 
the TOth message w m , the coefficient matrix must satisfy 
6^ £ rowspan(Q) over Z p . It often more convenient to check 
whether £ rowspan(A) over R and the following lemma 
gives a sufficient condition on when this is allowable. 

Lemma 4: Consider the set of Lx L integer matrices whose 
entries’ magnitudes are upper bounded by a constant a max . 
Let Sj n denote the TOth row of the L x L identity matrix. If 
£ rowspan(A), then any rate tuple (i?i,..., Rl) in the 
computation rate region for A from Theorem 1 (or Theorem 2 
below) is also achievable for recovering message w m . □ 
Roughly speaking, for large enough ' 1 p, it can be shown 
that 6^ £ rowspan(A) over the reals implies 6 ^ £ 

rowspan ([A] modp) over Z p since the entries are integer¬ 
valued and bounded. The proof follows along the same lines 
as that of [9, Theorem 5] and is omitted due to space 
considerations. 

Remark 6: From Lemma 3, it suffices to check integer 
matrices A whose entries’ magnitudes are upper bounded by 
A max (l + PH T H) for Theorem 1. Since we have imposed an 
upper bound on the maximum singular value of the channel 

l0 Recall that the field size p can be chosen as large as desired according 
to Definition 4. 
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matrix H in Section II-B, we automatically obtain an upper 
bound on the entries of all viable A. Furthermore, for sce¬ 
narios where some probability of outage is permitted, it can 
be argued that it suffices for the probability density function 
of the largest singular value to have a vanishing tail. See [1, 
Remark 10] for more details. 0 

In certain scenarios, it may be useful to recover the integer- 
linear combination of the codewords (rather than the linear 
combination of the messages). For instance, two relays in a 
network may wish to simultaneously transmit a linear function 
of the codewords to benefit from a coherent gain [75]. Addi¬ 
tionally, in a single-hop network, it is often more convenient 
to work directly with the codewords and ignore the finite field 
perspective. 

Lemma 5: Under the nested lattice coding framework em¬ 
ployed for Theorem 1 (and Theorem 2 in the next subsection), 
if the linear combinations ui,..., vll with integer coefficient 
matrix A can be successfully recovered, then the integer- 
linear combinations of the codewords a^X,..., a^X can be 
successfully recovered as well. □ 

This follows directly from Lemma 11 in Section VI. As a 
quick example, consider the scenario from Example 1. If the 
receiver can recover the modulo sum of the messages vR>, 
then it can also recover the real sum of the codewords x/ : . 

Remark 7: Notice that, for a single-hop network. Lemma 5 
allows us to directly check recoverability conditions over R 
instead of Z p . For instance, consider an integer matrix A 
such that (i) Si £ rowspan(A) over R and (ii) Si ^ 
rowspan([A] modp) over Z p . The latter condition means 
that it is not possible to retrieve the mth message from 
the recovered linear combinations. However, using Lemma 5, 
we can first recover the integer-linear combinations of the 
codewords AX, solve for the mth codeword x m over R, and 
then infer the mth message w m from x m . (This is not always 
possible in a multi-hop network since the destination may 
not have access to the channel observations that were used 
to recover the linear combinations.) 0 


B. Successive Computation 

Consider a classical receiver that recovers individual code¬ 
words in a certain order. It is well-known that, once a 
codeword has been successfully decoded, it is beneficial to 
remove it from the channel observation so that subsequent 
decoding steps encounter less interference. Here, we explore 
an analogue of this successive interference cancellation (SIC) 
technique for compute-and-forward. 

Assume that the receiver has (correctly) decoded the linear 
combinations with integer coefficient vectors ai,...,a m _i. 
These linear combinations can be used as side information at 
the receiver for two different forms of successive cancellation. 
The first reduces the effective noise variance and the second 
reduces the number of users that need to tolerate this effective 
noise. 

Remark 8: Without loss of generality, we restrict ourselves 
to the decoding order 1,2,..../.. Any other decoding order 
can be mimicked by permuting the rows of the integer coef¬ 
ficient matrix A. 0 


We begin by showing how the side information can be 
employed to decrease the effective noise variance. From 
Lemma 5, we know that the receiver has access to the integer- 
linear combinations of the codewords a|X,..., a^ l _ 1 X, 
which we will write concisely as A m _iX where 


a m — 1. 

Operationally, the receiver forms the effective channel 

y m lX 

= a mX + (b^H + cJjA m _i — a ^)X + b^Z , 

'---' 

effective noise 

where b m £ M iVr and c m £ M m_1 are equalization vectors. 
We denote the effective noise variance for successive compu¬ 
tation by 

^succ a m ; t)m 5 | A m _i) 

4 ±e ||(bln + clA m _, - *1 )X + b^z|| 2 

= llbmll 2 + II (b^H + A ro _! - a T )pV2|| 2 . (12) 

Note that, so long as aj n is not orthogonal to rowspan(A m _i), 
we can select c m to obtain a strictly lower effective noise 
variance than possible via parallel computation (9). 

Lemma 6: Assume that rank(A m _i) = to— 1. (Otherwise, 
delete rows of A m _i until it has full rank and ignore the 
associated linear combinations in the decoding scheme.) The 
equalization vectors b m £ R^’ and c m £ that 

minimize the effective noise variance from (12) are the MMSE 
projection vectors 

= ( a ™ - c: pt , m A m _ 1 )PH T (l + HPH T ) 1 
cj pt , m = a^F T FAT_ 1 (A m _iF T FAT_ 1 ) _1 , 

where F is a matrix that satisfies (7). Let 

N ro _! 4l-FAT_ 1 (A m _ 1 F T FAT_ 1 )- 1 A ro _ 1 F T (13) 

denote the projection matrix for the nullspace of FA^ l _ 1 . The 
minimal effective noise variance is 


A i — 


^succC^-? a m| A m _i) 

^succ(^’ ^m 5 C 0 pt ?m | A777,-1 

= a^F T N m _iFa m 

= ||N m _iFa m II 


(14) 


□ 

The proof can be found in Appendix C. 

The second form of successive cancellation utilizes the alge¬ 
braic structure of the codebooks, and was originally proposed 
in [9]. Recall that, in the parallel computation scheme, every 
fine lattice that participates in a^X must be able to tolerate 
the associated effective noise. Clearly, if we knew some of the 
individual codewords, we could remove them from the channel 
observation and thus relax the noise tolerance constraints on 
their fine lattices. However, we only have access to certain 






IEEE TRANS INFO THEORY, TO APPEAR 


11 


linear combinations of the codewords, but it turns out that 
this is still enough side information to relax the noise tolerance 
constraints in a similar fashion. 

In Section VI, we provide a detailed description of this 
algebraic successive cancellation technique. At a high level, 
it can be viewed as performing Gaussian elimination over Z p 
where row swaps are not permitted. The following definition 
will be used to specify valid user cancellation orders. 

Definition 5 (Admissible Mappings): Let A denote an Lx 
L integer matrix and let I C {1,..., L} x {1..... L } denote 
a set of index pairs. We say that I is an admissible mapping 
for A if there exists a real-valued, lower unitriangular 11 matrix 
L £ such that the (m, £)\h entry of LA is equal to zero 

for all (m,£) X. Let M( A) denote the set of admissible 
mappings for A. 0 

Within the context of our scheme, if X is an admissible 
mapping for A and (to, £) £ X , then user £ will not be 
cancelled out during the mth decoding step. Therefore, the 
fth transmitter must be able to tolerate the mth effective noise. 
The following theorem makes this notion precise. 

Theorem 2: For an AWGN network with L transmitters, 
a receiver, and power constraints Pi,...,Pl, the following 
computation rate region is achievable, 

H,A)= U U rc succ (H,A,I) 

Aez LxL xeM^A) 

rowspan(A)Crowspan(A) 

where 


p succ (h,a,i )±{(r 1 ,...,r l )£ 


Re <- log + 


Pf 


°succ(^! a m| A TO —l) 


(15) 

V(m, t) £ X > 


and aj n and a m ,e are the mth row and (to, £)th entry of A, 
respectively. □ 

The proof is presented in Section VI. 

Corollary 1: For every channel matrix H and integer co¬ 
efficient matrix A, the parallel computation rate region is 
contained within the successive computation rate region, 
T^-comp*(H, A) C Pcomp^(H,A). Furthermore, there exist H 
and A for which the subset relation is strict. □ 

Example 3: Consider three transmitters with equal power 
constraints, P\ = P 2 = P 3 = P, and a receiver that observes 
Y = 2Xi + X 2 + X 3 + Z and wants to decode the linear com¬ 
binations with integer coefficient vectors ai = [1 1 1] T and 
a 2 = [1 — 1 — 1] T . In our notation, this corresponds to channel 
matrix H = [2 1 1] and integer matrix A = [ap a 2 0] T . The 
effective noise variances for successively decoding these linear 
combinations are 


^succ(-^-? *v) 
^succ(l-l’ a 2 | <3-1) 


3 P + 2 P 2 
1 + 6 P 

P(3 + 24P+ 18P 2 ) 
8 + 38P + 36P 2 


11 A unitriangular matrix is a triangular matrix whose diagonal entries are 
equal to 1. 


Following Definition 5, the mappings 

Xi = {(1,1), (1,2), (1,3), (2,2), (2,3)} 
X 2 = {(1,1), (1,2), (1,3), (2,1)} 


are admissible using the lower unitriangular matrices 



' 1 

0 

o' 


'1 

0 

o' 

t-l 

II 

-1 

1 

0 

l 2 = 

1 

1 

0 


0 

0 

1 


0 

0 

1 


respectively. This yields the rate regions 

Psucc(H, A, Pi) 


— \ (Pi, R 2 , R 3 ) £ 


Psucc(H, A, 2b) 


, 1 
: max Re < — log 
~ l " £=1,2,3 2 


Pi < - log + 


1 + 6 P 
3 + 2P 

8 + 38P + 36P 2 \ 
3 + 24P + 18P 2 ) 


= •] (Pp, P 2 , P 3 ) £ R+ : max Re <\ log + ( 
^ £=1,2,3 2 V 


1 + 6 P 
3 + 2P 


„ , 1, + £8 + 38P+36P 2 ll 
mrnRt < - log ( 3 + 24 p+ 18 p, j). 


which are part of the achievable computation rate region 
expressed in Theorem 2. Note that the linear combination with 
integer coefficient vector a 2 cannot be directly decoded using 
Theorem 1 since cr 2 ara (H, a 2 ) = 3P > P. 0 

Example 4: We return to the scenario of Example 2 wherein 
the receiver wants all of the messages, A = I. Clearly, the 
mapping X = {(1,1),..., (L, L)} is admissible using the 
lower unitriangular matrix L = I. Consider the mth decoding 
step. The receiver has successfully decoded the first to — 1 
messages corresponding to A m _i = [5p • • • <$ m _i] T where 
8 i is the /th column of I. By setting the /th entry of the 
equalization vector c m to be b^h, (and the rest to 0), the 
effective noise variance from (12) will be reduced to 

||b m || 2 + ||(bT [0 ... 0h m ••• h L ] - sDp^W 2 . 


Following the same steps as in Example 2, it can be shown 
that we can reach an effective SNR of 

1 + P f hj^i+ ^2 h t 

for the (th user. Thus, the rate region (for successive decoding) 
is 


P SUC c(H,I) — |(Pi,...,Pl) €R£ : 

Rt<\ log (l + Pt hi (l+ J2 Pihih ^) h t 

This is equal to the rate region attainable via i.i.d. Gaussian 
codebooks and SIC decoding [76, Theorem 1] under the 
lexicographic decoding order. The SIC rate region for any 
decoding order can be attained via Theorem 2 by setting A 
to be the corresponding permutation matrix. 
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As shown in [7] (for equal powers), the rate region can 
be enlarged via successive integer-forcing decoding, i.e., tak¬ 
ing the union over all full-rank matrices A in Theorem 2. 
Specifically, the successive integer-forcing rate region strictly 
contains the union of the SIC rate regions across all decoding 
orders. (Of course, when time-sharing across decoding orders 
is permitted then SIC can attain the entire capacity region.) 0 

The statement of the computation rate region in Theorem 2 
takes a union over all integer matrices whose rowspan contains 
that of A. We now argue that it suffices to take this union over 
a certain subset of these matrices. As a motivating example, 
consider a receiver that wishes to recover a single linear com¬ 
bination with an integer coefficient vector a = [a\ ... a^] 7 
whose entries have a greatest common divisor larger than one, 
gcd(ai,..., ai) = a > 1. If the receiver attempts to decode 
the corresponding integer-linear combination directly, it will 
encounter an effective noise variance of <7^(11, a) = 11Fa11 2 
according to (11). However, the receiver can instead decode 
the linear combination with integer coefficient vector a = 
a -1 a, which will yield a lower effective noise variance of 
<7p am (H,a) = ||Fa|| 2 = cr —2 11Fa11 2 . Afterwards, it can scale 
by a to recover its desired linear combination. 

We now show to generalize this concept to decoding more 
than one linear combination. We will need the following 
definitions. 

Definition 6 (Unimodular Matrix): A square integer matrix 
is unimodular if its determinant equal to +1 or —1. 0 

It can be shown that the inverse of a unimodular matrix is 
itself unimodular. 

Definition 7 (Primitive Basis Matrix): An integer matrix 
A £ Z ixi whose rank is equal to M < L is said to be 
a primitive basis matrix 1 if it is of the form 

A = [ Am 

0 (L-M)xL. 

where A m is a full-rank, M x L integer matrix, and there 
exists an integer matrix B £ jf L ~ M ) xL such that the matrix 

A m 
B 

is unimodular. In other words, A can be completed to a 
unimodular matrix. 0 

The following theorem establishes that the computation rate 
region from Theorem 2 is unchanged if we only take the union 
over primitive basis matrices. 

Theorem 3: Let 

7^™>(H,A) = U 1J 7WH,A,I) 

_ Aez I ' x - L lex (A) 

A primitive basis 
rowspan(A)Crowspan(A) 

where 7?. SUCC (H, A,I) is defined in (15). This region is equal 
to the computation rate region from Theorem 2, 

^)(H,A) = ^)(H,A) . 

12 Our choice of terminology is inspired by the fact that a matrix of this 
form can be viewed as the basis of a primitive sublattice (i.e., a sublattice 
that is formed by intersecting an L-dimensional lattice with a subspace of 
dimension M < L) and that any L-dimensional lattice basis is unimodular. 
See [77, Section 1.2] for more details. 


□ 

The proof is deferred to Appendix D. 

Corollary 2: If the integer coefficient matrix A £ Z L x L 
has rank L, it suffices to take the union in Theorem 2 over 
the set of all unimodular matrices, rather than the set of rank 
L integer matrices. □ 

C. Computation for Multiple-Access 

At a first glance, it may seem that the lattice-based compute- 
and-forward framework developed above is a poor fit for 
multiple-access communication. Specifically, consider a re¬ 
ceiver that observes the sum of the transmitted signals. From 
Example 1, the receiver can decode the sum of the codewords 
at a high rate, but this alone is insufficient to discern the indi¬ 
vidual messages. On the other hand, if the transmitters employ 
(independent) random i.i.d. codebooks, then all message tuples 
will be mapped to different sums with high probability. 

However, within the compute-and-forward framework, a re¬ 
ceiver is not restricted to decoding a single linear combination. 
In fact, a natural multiple-access strategy is to decode L 
linearly independent linear combinations and then solve for 
the underlying messages. This approach was first proposed 
in [9] for the parallel computation strategy with equal powers, 
and it was demonstrated that, when combined with algebraic 
successive cancellation, it always operates within a constant 
gap of the multiple-access sum capacity. Subsequent work [7] 
showed that, under certain technical conditions, the successive 
computation strategy with equal powers can operate at the 
exact multiple-access sum capacity. Below, we extend these 
results to the unequal power setting. 

Recall that the multiple-access capacity region for our 
channel model is 

7£mac(H) = | (Ri, Rl) £ R+ : (16) 

< ilogdet(l + H 5 P 5 H 7 ) V<S C {1,..., L}\ 

i&S ' 

where Hs refers to the submatrix consisting of the columns 
of H with indices in S and P 5 to the submatrix consisting of 
the entries of P whose row and column indices are in S. The 
multiple-access sum capacity is simply i logdet(l-|-HPH T ). 
Any rate tuple in the capacity region is achievable using 
i.i.d. Gaussian codebooks at the transmitters combined with 
joint typicality decoding at the receiver [78, Section 15.3.1], 

In order to operate near the multiple-access sum capacity, 
we will need to uniquely map users to effective noise variances 
via the following definition. 

Definition 8 (Admissible Multiple-Access Mappings): Let 
I C {1 x {1 be an admissible mapping 

for integer matrix A £ Z LxL according to Definition 5 
and let L £ R ixi be the associated lower uni triangular 
matrix. We say that X allows a multiple-access permutation 
7 r : {1, 2,..., L} —» {1, 2,..., L} if LA is upper triangular 
after column permutation by 7 r. 0 

Intuitively, if X allows a multiple-access permutation 7r, 
the scheme will remove the (7r _1 (m))th codeword from its 
effective channel after the mth decoding step. This means 
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that the (7r -1 (TO))th codeword only needs to tolerate the first 
to effective noise variances. The details of this scheme are 
presented in Section VI. 

There is always at least one admissible multiple-access 
mapping for every full-rank matrix A £ Z LxL . For instance, 
one can apply LU factorization with column permutation 
only to find an appropriate L, admissible mapping I, and 
permutation Also, recall from Remark 7 that if the receiver 
can recover L linear combinations with a full-rank integer 
coefficient matrix A £ Z ixi , then it can solve for the original 
messages. 

Parallel Computation for Multiple Access: First, we note 
that, in the parallel computation strategy from Theorem 1, each 
user must overcome the effective noise for all linear combina¬ 
tions in which it participates. A simple multiple-access strategy 
is to recover a rank-L set of linear combinations that are then 
solved for the original messages. This corresponds to setting 
A = I in Theorem 1, leading to the following achievable rate 
region for multiple access: 


7fc)( H ,I) = 


'-'comp 


(Ri 1 ..., Rl) £ 


l L + 


Ri < max min — log + 
Ae1 LxL 0 2 

rank(A)=L 


Pi 

cr para(H, a m) 



This can be viewed as a generalization of the integer-forcing 
achievable rate from [3, Theorem 3] to unequal powers. As 
shown in [3], this strategy has a good ensemble average (e.g., 
for Rayleigh fading), but there are specific choices of H for 
which the sum rate lies arbitrarily far from the sum capacity. 

As argued in [9], by augmenting parallel computation with 
algebraic successive cancellation, we can operate within a 
constant gap of the sum capacity. Here, we extend this result 
to the unequal power setting. This corresponds to applying 
Theorem 2 and setting the equalization vectors for the side 
information to zero, c m = 0, so that (15) is replaced with 


ft ASC (H,A,2) = \ (R 1 ,...,R l ) £ 


M- ’ 


Re < ^ lo g 


Pe 


CT para(H, a m) 


for all (m, £) £ X > . 


The next definition and lemma are taken from [14, Section 
VIII] and will be used to select appropriate integer vectors for 
parallel computation. 

Definition 9 (Dominant Solutions): Let F be any matrix 
satisfying (7). A set of linearly independent integer vectors 
a*,...,a£ £ Z L satisfying ||Fa*|| < < ||Fa|J| is called 

a dominant solution if, for any linearly independent integer 
vectors ai,...,a£ £ Z L satisfying ||Fa.i|| < ••• < ||Fa£,||, 
we have that ||FaJ[J| < ||Fa m |j for m = 1 We will 

call an integer matrix A* £ Z ixL a dominant solution if its 
rows (aj) T ,..., (a^) T correspond to a dominant solution. 0 
Lemma 7 ( [14, Theorem 8]): For any F satisfying (7), 
there always exists a dominant solution a], a^,..., a* L £ Z L 
satisfying 


a} = argmin { ||Fa|| : a £ Z L \ {0}} 


a% = argmin { ||Fa|| : a £ Z L , a, aj linearly independent} 


a L = argmin(||Fa|| : a £ Z L , a,a},... ,a? L _ x 

linearly independent} . 


□ 

See [14] for a proof as well as a greedy algorithm. 

We can now show that the parallel computation strategy, 
when combined with algebraic successive cancellation, is 
approximately optimal for multiple access. 

Theorem 4: For any dominant solution A* £ Z ixi and 
admissible multiple-access mapping T with permutation n, the 
rate tuple 



para 






is achievable via the parallel computation strategy com¬ 
bined with algebraic successive cancellation, (R\,... ,Rl) £ 
7?-asc(H, A*,I). The sum of these rates is within a constant 
gap of the multiple-access sum capacity. 


Y. R i>\ l°g det (I + HPH t ) - | log(L) . 


□ 

The proof can be found in Appendix E. 

Successive Computation for Multiple Access: It is well- 
known that the corner points of the Gaussian multiple-access 
capacity region can be attained using i.i.d. Gaussian codebooks 
at the transmitters along with SIC decoding [76, Theorem 
1], As we will argue below, successive computation enjoys 
a similar optimality property: there is always at least one 
integer matrix for which the successive computation rate 
region includes a rate tuple that attains the multiple-access 
sum capacity. For instance, as shown in Example 4, successive 
computation decoding can mimic SIC decoding and attain 
the corner points of the capacity region. Furthermore, the 
successive computation rate region often includes non-corner 
points that attain the sum capacity (i.e., rate tuples that lie 
on the interior of the dominant face of the multiple-access 
capacity region). These points are not directly accessible via 
SIC decoding (but can be attained by enhancing SIC with 
time-sharing [79, Section 4.4] or rate-splitting [80]). 

Our optimality results stem from the following identity. 

Lemma 8 : For any unimodular matrix A £ Z ixL and 
permutation 7r, we have that 


^ 2 8 \ °fucc (H. 1^A^OO-i) / 

= ilogdet(l + HPH T ) . (17) 


□ 

See Appendix F for a proof. 

At a first glance, it may appear that Lemma 8 implies that 
every unimodular matrix yields sum-rate optimal performance 
for successive computation. However, this lemma does not 
guarantee that the rates appearing in (17) are achievable. 
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Specifically from (15), for an admissible mapping T that 
allows multiple-access permutation n, decoding is possible if 
the fth user can tolerate effective noise of variance 

max ofuc C (H, a m |A m—1 ) • 

In order to apply Lemma 8 and show that Theorem 2 attains 
the exact sum capacity, this maximum must be equal to 
CT Lc( H > a ^)l A ^)-i) for i Moreover, the 

achievable rate expression in (15) are written in terms of the 
log + function whereas the summands in (17) simply use the 
log function. Thus, each user’s power should exceed its as¬ 
sociated effective noise variance. Putting these two conditions 
together and applying Lemma 8, we arrive at the following 
optimality result for successive computation. 

Theorem 5: Let A be an L x L unimodular matrix and 
let I be an admissible mapping that allows multiple-access 
permutation 7r. If, for l = 1,..., L, we have that 

max <t succ (H, a m | A m _i) = cj succ ^H, | 

(18) 

and Pg > of ucc (H, a.n(e) \ then the rate tuple 

Re = I log ( 2 —~~\~Z - \) t=h---,L 

^ \ ^succ (-*A a 7r(f) | -^-7r(t?) — 1 ) J 

is achievable via the successive computation strategy, 
(Ri ,..., Rl ) G 7?. SUC c(H, A, I). Moreover, the sum of these 
rates is equal to the multiple-access sum capacity, 

L 1 

J2 R e = xlogdet(l + HPH T ) . 

e=i 

□ 

Remark 9: It is sometimes convenient to replace the con¬ 
dition in (18) with the stricter condition that 

<4cc(H,ai) < of ucc (H,a 2 |Ai) < • ■ • < cr s 2 ucc (H,a L |A L _i) . 

0 

Remark 10: For any channel matrix H and power ma¬ 
trix P, there always exists a unimodular matrix A and 
multiple-access mapping I with permutation tt for which 
Theorem 5 applies. For instance, as shown in Example 4, 
we can attain the performance for SIC decoding with order 
7 r by settting A to be the associated permutation matrix and 
T = {(1,7r(l)),..., (L, 7r(L))}. The next example demon¬ 
strates that the successive computation strategy can attain sum- 
capacity rate tuples that do not correspond to SIC corner 
points. 0 

Two-User Example: Consider a two-user multiple-access 
channel (i.e., L = 2) with channel matrix H = [hi h 2 ], For 
H = [1 |], Pi = 7, and P 2 = 4, we have plotted, in Figure 4, 
the capacity region (16) and marked the rate tuples achievable 
via SIC decoding, parallel computation for multiple access 
from Theorem 4, and successive computation for multiple 
access from Theorem 5. Specifically, SIC decoding yields the 
rates pairs (0.3828,1.6610) and (1.5000,0.5437). Successive 
computation can attain both of these rate pairs as well as 
(1.0850,0.9588) and (1.3624,0.6813). Finally, there are two 



Fig. 4. Plot of the Gaussian multiple-access capacity region as well as the 
dominant rate pairs for parallel computation, sum-capacity optimal rate pairs 
for successive computation, and SIC comer points for the channel matrix 
H = [1 |] and powers P± = 7 and P 2 = 4. 

dominant rate pairs for parallel computation: (1.3624,0.5903) 
and (0.9940,0.9588). Note that all successive computation 
rate pairs are sum-capacity optimal and that the parallel 
computation rate pairs are much closer to the sum-capacity 
than the (worst-case) bound of 2 bits. 13 () 

Remark 11: In independent and concurrent work, Zhu and 
Gastpar have developed a compute-and-forward approach to 
multiple-access [62], Their main idea is to use the same 
fine lattice at each transmitter along with different coarse 
lattices. The second moments of the coarse lattices are set 
according to the desired rates, and each transmitter scales 
its lattice codeword to meet its power constraint. (Note that 
their approach does not establish a correspondence to a finite 
field.) Overall, they establish that a significant fraction of the 
multiple-access sum-capacity boundary is achievable via this 
approach. We note that the underlying compute-and-forward 
result for unequal powers [62, Theorem 1] is a special case of 
Theorem 1. 0 

D. Multiple Receivers 

So far, we have limited our discussion and results to single¬ 
receiver scenarios. Although this has allowed us to introduce 
our main ideas in a compact fashion, the compute-and-forward 
framework is the most useful in scenarios where there are 
multiple receivers that observe interfering codewords. We 
now expand our problem statement to permit I\ receivers, 
potentially with different demands. (See Figure 5 for a block 
diagram.) As we will see, the parallel and successive compu- 

''MATI.AB code to generate this plot for any (two-user) choice of H and 
P is available on the first author’s website. 
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tation rate regions can be expressed in terms as intersections 
of the corresponding rate regions for each receiver. 



u ™ = © 9m,G [w f ]| 


e=i 

Fig. 5. Block diagram for the compute-ami-loruard problem with multiple 
receivers. Each transmitter has a message w; whose elements are taken from 
Tip. This message is embedded into T^ (by zero-padding), mapped into a 
codeword x/ £f", and sent over the channel. Each receiver observes a noisy 
linear combination of these codewords, = H fxi X 2 ■ • ■ x/_] T I ZhI, 

til 

and attempts to recover the linear combinations u; J ,..., iq of the coset 
representatives of the original messages. 

As in Section II-B, for i = 1 the ith receiver 

is equipped with antennas and observes a noisy linear 
combination of the channel inputs, 


a set-valued function 7^ com p (HW,..., H^l, AW,..., A^l) 
that maps I\ -tuples of channel matrices (H^,..., H^) G 
jjJvWxl x x xL and K -tuples of integer coefficient 

matrices (AW,..., A^'l) G Z LxL x ■ ■ • x Z LxL to a subset 
of The computation rate region described by 7£ com p is 
achievable if, for every rate tuple {R \ 1 f? 2 , • ■ •, Rl) G Ri, 
e > 0 , and n large enough, there exist 


• parameters p. kc.t, k-pj_ satisfying 
Re — e for i = 1,2,..., L, 

• encoders £i,...,£ l, 


ky : e — kc,e 
n 


log p 


> 


such that, 

• for all I\ -tuples of channel matrices (HW,..., H^ A 1) G 
RAf'xi x ... x R Nl r K] * L and 

• every A-tuple of integer matrices (AW,..., A^l) 

satisfying (Ri ,..., R L ) G 

'^•comp(H[ 1 l, . . . , H^l, AW, . . . , A[ A 1) 

there exist decoders T>i,, V K with probability of decoding 
error at most e. 0 

We can now state the achievable computation rate regions 
for parallel computation and successive computation. 

Theorem 6: For an AWGN network with L transmitters sat¬ 
isfying power constraints Pi. If,..., Pl , respectively, and K 
receivers, the following computation rate region is achievable. 


Y [i l = + z[i] 

e=i 

where hj^ G 1 is the channel vector between the £th 
transmitter and the zth receiver and Z is elementwise 
i.i.d. Af(0, 1). We group the channel vectors corresponding 
to the ith receiver into a channel matrix 


AW...,Aro) 

K 

= f|^mp ) (H [i] ,A [il ) 

i= 1 

for the function ^comp'* (H^, A^) as defined in Theorem 1. 

□ 


rW 4 




The problem statement is essentially the same as in Sec¬ 
tion II-D. Following Definition 3, we define a decoder : 
Rx M. N '' ]xL x Z LxL —> Zp Xk for the ith receiver 
that takes as inputs the channel observation Y^, the channel 
matrix H', and the desired integer coefficient matrix A '”, and 
outputs an estimate TjW = pN(Y^, H^, A^l). Let (uS) T 
denote the mth row of U^. We say that decoding is successful 
if uj^ = ,..., 11 ^ = 11 ^; for some linear combinations of 

the form 

L 


The proof can be inferred from that of Theorem 1 : the encoders 
implement the same scheme as in the single-receiver case and 
each receiver implements the parallel computation decoder 
described in Section V. 

Theorem 7: For an AWGN network with L transmitters sat¬ 
isfying power constraints P\ .If,, P^, respectively, and K 
receivers, the following computation rate region is achievable, 

^ } (H [11 . H^.A! 1 !,...^^ 

K 

= n^S ) (H«,AW) 


U m = ® 

t =1 

where the ( G Z p are the entries of Q^'- = [A^] modp 
and Wf G |w/ |. An error occurs if decoding is not successful 
for some i G {1,..., K}. 

Remark 12: Note that we insist that the same coset repre¬ 
sentatives w/ are used across receivers. This is to ensure that 
the linear combinations from multiple receivers can later be 
put together to recover the original messages. <) 

We now adjust our definition of a computation rate region 
to accommodate multiple receivers. 

Definition 10 (Multiple-Receiver Computation Rate Region): 
A computation rate region for K receivers is specified by 


i=l 

for T^comp)(HM, AW) as defined in Theorem 2. □ 

Again, the proof can be inferred from that of Theorem 2: the 
encoders implement the same scheme as in the single-receiver 
case and each receiver implements the successive computation 
decoder described in Section VI. 

Unfortunately, the multiple-access sum-capacity optimality 
results do not transfer directly from the single-receiver setting. 
Nonetheless, in the multiple-receiver setting, computation for 
multiple-access outperforms i.i.d. Gaussian encoding with SIC 
decoding even when time-sharing is allowed. We explore this 
phenomenon in the context of a compound multiple-access 
channel below. 
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E. Case Study: Two-User Gaussian Compound Multiple- 
Access Channel 

We now take an in-depth look at the performance of SIC 
and compute-and-forward for a two-user Gaussian compound 
multiple-access channel. In our notation, this corresponds to 
L = 2 transmitters with messages Wj and W 2 , respectively, 
and I\ = 2 receivers that both want to recover wj and W 2 . 
The capacity region is the intersection of the individual MAC 
capacity regions, 

= 7e M Ac(HW) n7e M Ac(H[ 2 l) 

and can be achieved via i.i.d. Gaussian coding and joint 
typicality decoding. As discussed below in Remark 14, the 
compound MAC often appears in the context of A'-user inter¬ 
ference channels. In some scenarios, the transmitters may opt 
to induce interference alignment by using lattice codebooks 
instead of i.i.d. Gaussian codebooks. This motivates the need 
for lattice-based decoding strategies. 

In order for the ith receiver to successfully recover both 
messages with SIC decoding, the rates must fall within the SIC 
decoding region for the corresponding (two-user) multiple- 
access channel, 

7ffi(H [i] ) =ftsic,a(HM) UTZsk^hN) 

ftsic > a(H[‘1) = {(R 1 ,R 2 )€M 2 + : 

Ri < ^log(l + ||h' <] || 2 Pi), 

R 2 < ilog(l + P2(4 i] ) T (l + P 1 h' il (h[ i] ) T )" 1 h[ i1 )} 

^sic, 6 (H [iI ) = {(Ri,R 2 ) : 

Pi < ilog(l + Pi(h[ fl ) T (l + i^h^(hf) T )" 1 hi i] ) J 

P 2 < ilog(l + ||ht l] || 2 P 2 )} . 

Thus, for the compound multiple-access channel, SIC decod¬ 
ing combined with time-sharing can attain 

P-cmac( H[1 ] ,H [2] ) = conv(<S(HW) nC(H [2] )) 

where conv refers to the convex hull operation. Note that 
SIC decoding does not, in general, attain the sum-capacity 
even if aided by time-sharing. This is due to the fact that the 
corner points of the two multiple-access capacity regions do 
not coincide nor do the time-sharing ratios required to reach 
any other sum-capacity points. 

Successive computation does not reach the sum capacity 
for similar reasons. Namely, the sum-capacity rate pairs that 
can be directly attained with successive computation (see 
Theorem 5) will differ across receivers as will the required 
time-sharing ratios for other sum-capacity points. Using the 
achievable computation rate region from Theorem 7 combined 
with time-sharing and setting AW = A^ 2 ! = I, we get that 

P^ac(hW, HP!) 

= conv(^ m f(HW,l) n 7^>(H [2] ,I)) 


is achievable for the compound multiple-access channel. The 
flexibility to optimize over full-rank integer matrices A yields 
a larger rate region than SIC, as shown below. 

Example 5: In Figure 6 , we have illustrated how these 
rate regions are calculated for a compound multiple-access 
channel with H™ = [3.3 2.1], = [2.4 4.2], Pi = 4, 

and P 2 = 3. The corner points of the multiple-access ca¬ 
pacity region with respect to receiver 1 are (1.0109,1.9154) 
and (2.7388,0.1875). Successive computation achieves these 
corner points as well as the rate pairs (1.5084,1.4180) and 
(1.6255,1.3008). With respect to receiver 2, the corner points 
are (0.2566,2.8764) and (2.2937,0.8393) and successive 
computation achieves these corner points as well as the rate 
pairs (1.3799,1.7531) and (1.9606,1.1724). As expected, 
after intersecting the individual multiple-access regions and 
time-sharing, neither SIC nor successive computation reach 
the sum capacity. 

Remark 13: In recent work, Wang et al. [81] have demon¬ 
strated that, for this scenario, a variation of SIC that encodes 
messages across multiple blocks and employs sliding-window 
decoding can match the performance of joint typicality decod¬ 
ing. 0 

Remark 14: The compound multiple-access channel is an 
important building block for understanding the capacity re¬ 
gion of multi-user interference networks. For instance, in the 
strong interference regime, the capacity region of a two-user 
interference channel corresponds exactly to that of a two-user 
compound multiple-access channel [82], [83]. In this two-user 
setting, successive computation is inferior to i.i.d. random cod¬ 
ing with joint typicality decoding. However, in an interference 
channel with three or more users, there is the possibility of 
interference alignment [84], [85], For instance, in the A'-user 
symmetric Gaussian interference channel, the sum capacity in 
the strong interference regime can be approximated by that 
of a two-user symmetric Gaussian compound multiple-access 
channel [9]. To induce alignment at the receivers, the scheme 
from [9] employs the same lattice codebook at each transmitter 
and has each receiver decode its desired message indirectly, 
by first recovering two independent linear combinations via 
compute-and-forward. In recent work [10], we have shown that 
lattice interference alignment is possible in any setting where 
the beamforming vectors are aligned “stream-by-stream” so 
long as the codewords are allowed to have unequal powers. 

IV. Nested Lattice Codes 

In this section, we describe the nested lattice codes that will 
be the building blocks of our encoding and decoding schemes. 
We begin with some basic lattice definitions in Section IV-A, 
present nested lattice constructions and properties in Section 
IV-B, and discuss mappings to and from Z]’j in Section IV-C. 

A. Lattice Definitions 

We now review some properties of lattices that will be 
useful for our code constructions and refer interested readers 
to the textbook of Zamir [60] for a comprehensive treatment of 
lattices for coding. A lattice A is a discrete additive subgroup 
of M" that is closed under addition and reflection, i.e., for any 
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R2 


Intersection of Receiver 1 and 
Receiver 2 Rate Constraints 


2 


^MAc(HW)n^MAc(Hl 2 ]) 


R.2 


3 - - 


2 - - 



Fig. 6. Step-by-step illustration for determining the capacity region, successive computation rate region, and SIC rate region for a compound multiple-access 
channel with two transmitters with powers Pi = 4 and P 2 = 3, respectively, and two receivers with channel matrices = [3.3 2.1] and = [2.4 4.2], 
respectively. The top left depicts the rate constraints that must be satisfied for the first receiver to successfully decode both messages and the bottom left 
depicts the corresponding rate constraints for the second receiver. The top right shows the intersection of these rate constraints and the bottom right shows the 
achievable rate regions for the compound multiple-access channel that comes from convexification of the intersected regions, which corresponds operationally 
to time-sharing. Note that no convexification is needed for the capacity region 7 ^mac(HW ) nT^MAc(H^) since it is the intersection of two convex rate 
regions. 


Ai, A2 G A, we have that — Ai, — A2 G A and Ai + A2 G A. 
Note that this implies that the zero vector 0 is always an 
element of the lattice. 

Let 

Qa(x) a argmin ||x - A|| 

AeA 

denote the nearest neighbor quantizer for A. Using this, we 
define the (fundamental) Voronoi region V of A to be the set 
of all points in E" which are quantized to the zero vector 
(breaking ties in a systematic fashion). We also define the 
modulo operation , which outputs the error from quantizing x 
onto A, as 

[x] mod A = x — Qa(x) . 

The modulo operation satisfies a distributive law , 

[a[x] mod A + 6[y] mod A] mod A = [ax + by] mod A , 

for any integers a,b £ Z. The second moment of a lattice, 
denoted as cr 2 (A), is the second moment per dimension of the 


norm of a random vector that is drawn uniformly over the 
fundamental Voronoi region V, that is. 


0-2 (A) ^ 


Vol(V) 


dx , 


where Vol(V) denotes the volume of V. 

The following lemma will be useful in characterizing the 
distributions of dithered lattice codewords. 

Lemma 9 (Crypto Lemma): Let A be a lattice, x a random 
vector with an arbitrary distribution over 1", and d a random 
vector that is independent of x and uniform over V. It follows 
that the random vector [x + d] mod A is independent of x and 
uniform over V. □ 

See [60, Ch. 4.1] for a detailed discussion and proof. 

We say that lattice Ac is nested in lattice Ap if Ac C Ap. 
The lattice Ap is often referred to as the fine lattice and Ac as 
the coarse lattice. The coarse lattice induces a partition of the 
fine lattice into cosets of the form A +Ac for A £ A F . The set 
of all such cosets is written as Ap/Ac = {A +Ac : A £ Ap}. 
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(The notation Ap/Ac refers to the fact that this is a quotient 
group.) It will often be useful to represent each coset by a 
single element, i.e., a coset representative. We will represent 
each coset by its minimum norm element to obtain a set of 
coset representatives, 

[Ap/Ac] — Ap mod Ac ■ 


A nested lattice codebook C is generated using a nested 
lattice pair Ac C Ap. The codebook C comprises all elements 
of the fine lattice that fall within the Voronoi region Vc of the 
coarse lattice, C = Ap (~l Vc- The rate of the codebook is 

1 . |,| 1 . A'olTr) \ 

-loglCb-log^-p-yj 

where Vf is the Voronoi region of Ap. It can be shown 
that the set Ap n Vc is equal to the set of minimum-norm 
coset representatives [Ap/Ac]. This implies that the codebook 
C can be interpreted algebraically (i.e., as the set of coset 
representatives [Af/Ac]) and geometrically (i.e., as the set of 
all elements of the fine lattice Ap that are in the Voronoi region 
of the coarse lattice Ap). Loosely speaking, the geometric 
properties of our lattice constructions will be useful in ensuring 
that the codewords are spaced sufficiently far apart in K™ and 
that the power constraints are satisfied (Theorem 8). Similarly, 
the algebraic interpretation will be useful for constructing 
a linear mapping between our nested lattice codebooks and 
Zp, the vector space from which the messages are drawn 
(Theorem 9). 

Finally, note that any nested lattice pair Ac C Ap satisfies 
the following quantization property: 

[<9a f (x)] mod A c = Qa f ([x] mod A c ) mod A c . (19) 


B. Nested Lattice Constructions 

We will employ the nested lattice construction of Ordentlich 
and Erez [59] as part of our achievability scheme. We will 
design nested lattice codebooks using the same parameters 
in our problem statement, n,p, kc,i, kpt, for t = 1 
Consider the prime p and the corresponding finite field 4 Z p . 
The key idea is to take a series of nested linear codes of 
length n over Z p and then lift the codes from Z” to 1" using 
Construction A to obtain a series of nested lattices. 

We now specify parameters 15 for which sequences of good 
nested lattices exist. Let P max = max^ Pi and V n be the 
volume of an n-dimensional ball of radius 1. Following the 
construction in [59], for a given blocklength n, powers Pi, 
and noise tolerances cr 2 fff , we will set p to be the largest 
prime between \n 3 / 2 and n 3 / 2 , which is guaranteed to exist 
for n > 1 by Bertrand’s Postulate 1 ' 1 , and 

7 = 2\J nP max 2“ 

14 The field Z p is considered here rather than a generic prime-sized finite 
field since we will use the natural mapping from Z p to Z to lift our codes 
from the finite field to reals. 

15 These parameter choices are made to simplify the existence proofs. For 
instance, the prime p is chosen to grow with n so that the channel input 
distributions will look nearly Gaussian. In practice, one could take p to be 
relatively small and accept the rate loss associated with p-ary inputs to a 
Gaussian channel. 

16 For any m > 3, Bertrand’s Postulate, as proven by Chebyshev [86], states 
that there exists a prime between m and 2m. 
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*«= 2 l^( l0g (^) +1 ° g (^) + “) <20) 
^ = 5T^( 1<>g (^) +log (^) + “) <2 " 

where a > 0 will be chosen as part of Theorem 8 . 

Recall that fop = max^ kpj; and consider a kv x n matrix 
G over Z p . Let Gc,f denote the matrix consisting of the first 
kc,e rows of G and let Gp ^ denote the matrix consisting of 
the first kp t i rows of G for £ = 1 ,L. Define Cq,i and Cp^ 
to be the vector spaces generated by taking the columns of 
Gj, and Gp^ as a basis, respectively: 

C c .e = {g£*w : w G Zp c ’ g } 

C Fl e = {Gj £ w : w G Zp^} . 

Note that Cc,i- Cf.g £ = can also be viewed as an 

ensemble of nested linear codes. 

Define the mapping </ : Z p —s- R as 

4 >{w) = 7 p~ l w 

along with the inverse map 

4 >{k) = [ 7 ~ 1 pn\ modp , 

which is only defined over the domain 7 p - 1 Z. When applied 
to vectors, these mappings operate elementwise. 

Following (a scaled version of) Construction A, we create 
the lattices 

Ac,* = {* G tp^Z" : 0(A) € Cc,*} 

Ap,* = {a G 7 p _ 1 Z" : 0(A) G C Fj £ j 

Note that, by construction, A G Ac,^ (or App) if and only 
if 0(A) G Cc,e (or Cp^). We will refer to 0(A) as the 
corresponding linear codeword of A. 

We denote the Voronoi regions of Ac t t and Apy by Vc j and 
Vf.p, respectively. All 2 L lattices in this ensemble are nested 
with respect to the permutation that places the parameters kc,e 
and kp t e in increasing order (i.e., they form a nested lattice 
chain). In particular, since kcj < kv/, the nested lattice 
codebook Ct can be constructed using Ac 7 as the coarse 
lattice and Ap ^ as the fine lattice, 

Ci = Apn Vc,^. 

The theorem below restates key existence results from [59] 
in a form that is convenient for our achievability proofs. 
At a high level, the theorem guarantees that there exists 
a generator matrix G, such that, for t = 1,...,L, the 
submatrices Gc,^, Gp /. are full rank and that each resulting 
nested lattice codebook ,C( satisfies its power constraint Pi , 
tolerates effective noise with variance up to a/ lf f < Pi, and 
has rate close to -7 log(Pf/cr/ ff e ). 

Theorem 8 ( [59, Theorem 2]): Consider any choices of 
powers Pi > 0 and effective noise tolerances 0 < a/ rf f < Pi 

^Construction A was originally proposed in [87] as the vectors of the 
integer lattice Z n whose modulo-p reduction are elements of the linear 
codebook C C Zp, i.e., {A G Z n : [A] mod pGC}. 
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for £ = 1For any e > 0 and n large enough, there 
is a constant a > 0 such that for the choices of kc,p, kp t p 
in (20)-(21), there exists a matrix G G Zp FXn , such that, for 

e = i,...,L, 

(a) the submatrices Gc/, G F / are full rank. 

(b) the coarse lattices Ac/ have second moments close to the 
power constraint 

Pp — e < o' 2 (Ac/) < Pi ■ 


(c) the lattices tolerate the prescribed level of effective 
noise. Specifically, consider any linear mixture of Gaus¬ 
sian and Voronoi-shaped noise of the form z e ff = 
A)Zo + J2e=i Pe z e where /3 0 , ft,..., /3 L G R, z 0 ~ 
A/"(0,I), zp ~ Unif(Vc/), and the noise components 
zo, zi,..., zx, are independent of each other and A. Then, 
for any fine lattice point A G Ap, m , 

P(<2A F ,m + z eff) 0 e 

if 0o + Y0p =1 010 < CTeff ,rn- similarly, for any coarse 
lattice point A G Ac, m , 


P^QA c , m (-^ + z eff) 7^ ^ e 

if/3 0 2 +Eti^<^- 

(d) the nested lattice codebooks Cp = Ap/ H Vc/ have 
appropriate rates 


1 , i i fcp r — kr p , 1 , 

- log = —-- logp > - log 

n n 2 


Pi 

t 2 

7 eft / 


— e 


□ 

Note that (a) is established in the proof of [59, Theorem 1], 
which is then used to establish [59, Theorem 2] as a corollary. 

Remark 15: In Theorem 8, we have only stated lattice prop¬ 
erties that are essential for our achievability proofs. In many 
cases, it can be shown that nested lattices satisfying stronger 
versions of these properties exist, which in turn could be used 
to relax some of the assumptions in our problem statement. 
For instance, one can select lattices that are tuned for non- 
Gaussian channel noise. See [58]—[60] for more details. 0 


Let £ m j n = argmin^fcc/ and £ max = argmax £ fc F /. It will 
be convenient to define Ac = Ac p ■ and At = hyp as 
the coarsest and finest lattices in the ensemble, respectively. It 
follows that Ac C Ac/ C Ap/ C A P for £ = 1,..., L. Recall 
that kc — min/ fcc/, kp = max/ fep/, and k = kp — kc- 
Definition 11: A mapping ip : Ap —>• is called a linear 

labeling if it satisfies the following two properties: 

(a) A lattice point A belongs to Ap/ if and only if the last 
kp — kpj components of its label A) are equal to 0. 
Similarly, a lattice point A belongs to Ac/ if and only if 
the last kp — A// components of its label <p(A) are equal 
to 0. 

(b) For all ap G Z and Ap G Ap, we have that 

ipl^apApj = (£)qp ip(Ap) 

l ' i= l 


where qp = [ap] mod p. 


0 

Our proposed linear labeling stems directly from the fact 
that, for any A G Ap, the corresponding linear codeword can 
be expressed as </>(A) = G T v for some vector v G Z^L 
Assuming that G is full rank, then this vector is unique. 

Theorem 9: Assume that G is full rank. Let p : Ap Z)' 
be the function that maps each A G Ap to the vector <p(A) 
that consists of the last k components of the unique vector v 
satisfying <^(A) = G T v. Then, p is a linear labeling. □ 
See Appendix G for a proof. We have depicted Theorem 9 
through signal levels in Figure 7. 

It will also be useful to have an explicit inverse (p : Z£ A f 
for the linear labeling tp from Theorem 9. Specifically, define 


¥>(w) = 



It follows that = w. 

We now have all the ingredients we need to construct coding 
schemes for compute-and-forward. The next two sections de¬ 
velop coding schemes for parallel and successive computation, 
respectively. 


C. Linear Labeling 

We now show how this ensemble of nested lattice code¬ 
books can be connected to computing linear combinations 
of messages over Z p . Roughly speaking, we would like to 
map messages to nested lattice codewords so that our desired 
linear combinations can be directly recovered from appropriate 
integer-linear combinations of the lattice codewords. The origi¬ 
nal compute-and-forward framework [1] directly employed the 
Construction A mapping from a linear code to a lattice code. 
Subsequent work by Feng et al. [14] developed a richer set of 
algebraic connections as well as guidelines for selecting codes 
and constellations that are amenable to compute-and-forward. 
In the process, [ 14] proposed the concept of a linear labeling 
as an elegant way to map between an algebraic message space 
and a nested lattice code. We adopt this approach for our 
expanded framework. 


V. Parallel Computation Achievability: Proof of 
Theorem 1 

We now provide a detailed description of the encoding and 
decoding strategies used to achieve the parallel computation 
rate region from Theorem 1. Notice that, for a given integer 
coefficient matrix A, we determine the computation rate region 
T^comp'* (H, A) by taking the union over all integer matrices 
A whose rowspan contains that of A. This has a clear 
operational meaning within our scheme: the receiver recovers 
linear combinations with integer coefficient matrix A and then 
solves these for the desired linear combinations with integer 
coefficient matrix A. As a first step, we will show how to 
directly recover linear combinations with integer coefficient 
matrix A (for notational simplicity,) and derive conditions 
under which the probability of error can be driven to zero 
in Lemma 10. This will serve as a building block in the proof 
of Theorem 1 . 
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Venn Diagram of Nested Lattices 



kp 


kp 


kc,£ 


(a) 



Fig. 7. (a) Depiction of the linear labeling ip of Theorem 9 via signal levels for a user i with fc F = 13, krj = 11, and kc,l = 3. We use black circles to 

represent zeros and lightly shaded (blue) circles to represent occupied symbols. On the left-hand side, we see that the linear labeling of a lattice point from 
Ap £ must have zeros in its last fcp — fcp ( entries. Similarly, on the right-hand side, we see that the linear labeling of a lattice point from Aq l must have zeros 
in its last k\- — kc j entries, (b) Depiction of the linear labeling tp of Theorem 9 of an element of Ap^/Ac £. The * elements represent “don’t care” entries; 
for a given choice of signal levels represented by the blue circles, different realizations of the “don’t care” levels correspond to different representatives of 
the same coset. 


We begin with a high-level description of the encoding steps 
used to map the finite field messages onto dithered lattice 
codewords as well as the decoding steps used to estimate 
integer-linear combinations of lattice codewords, which are 
then mapped back to linear combinations of the messages. 
Take any choice of rates ,..., II j and parameter e > 0, and 
apply Theorem 8 to select good nested lattices. Afterwards, 
apply Theorem 9 to obtain a linear labeling tp and its inverse 
tp. The encoding and decoding process will make use of 
random Ks dither vectors that are generated independently and 
uniformly over the Voronoi regions of the coarse lattices, 

d e ~ Unif(Vc^). 

The ith encoder begins by adding kc,e ~ h: leading zeros 
and fcp — kp : £ trailing zeros to its message £ Zp M kc,e . 
The resulting length-fc vector is mapped onto a lattice point 
At £ Ci using the inverse p of the linear labeling followed by 
taking modulo A Cj r. Afterwards, the encoder dithers At using 
<T to obtain its channel input x/. These encoding operations 
are illustrated in Figure 8 and formally written out in (22). 
Note that encoding does not depend on either the channel 
matrix H or the choice of integer coefficient vector a m . 



Ah Encoder 


Fig. 8. Block diagram for the ^th encoder for both parallel computation and 
successive computation. 


Encoding: 


At = 




Ofcc,« —fee 


kp—kp g 


mod Ac,^ 


x e = [A e + dr] mod A c ,* 


(22a) 

(22b) 


18 The use of random dither vectors should not be viewed as common 
randomness, but rather as part of the random coding proof. In Appendix H, 
we will show that it suffices to use fixed dither vectors. 


To recover the linear combination u m with integer coef¬ 
ficient vector a m , the receiver first applies the equalization 
vector b m £ R' :Vr to its channel observation Y, to obtain the 
effective channel output y m . The receiver then attempts to 
recover the integer-linear combination 


where 


l l rn ~ 


^ ' 0>m,tAi 


L ^= 1 


mod Ac 


At — At — Q a c t (Af + df) 


(23) 

(24) 


is a lattice point in the same coset Ap t i/Ac,i as the Ah user’s 
lattice codeword At. An estimate of n m is obtained by 
first subtracting an integer-linear combination of the dithers 
from y m , then quantizing onto the fine lattice A F ,e( m ) where 

9{m) = argmax{fc F ^ : i £ {1 ,... ,L} s.t. [a m ^] mod p ^ 0} 


denotes the index of the finest lattice amongst those that 
participate in the integer-linear combination n rn , and finally 
taking mod Ac- To make an estimate u m of the desired linear 
combination u m , the receiver simply applies the linear labeling 
ip to /T m . These decoding operations are illustrated in Figure 9 
and formally written out in (25). 



Fig. 9. Block diagram for the rath decoder for parallel computation. 


Decoding: 

y T = b T Y 

J m m 


Am = 


<9 A, 




U m — dyP'm) 


L 



i -1 



mod Ac 


(25a) 

(25b) 


(25c) 
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The following lemma describes rates that are attainable for 
the coding strategy above. 

Lemma 10: Consider any choice of rates Ri ,..., Rl and 
parameter £ > 0 and assume that the transmitters and receiver 
employ the encoding and decoding operations from (22)-(25). 
For n and prime p large enough, there exists a generator 
matrix G £ Z k f xn and corresponding nested lattice codes 
Ci,,C l with rates at least Ri — £,..., Rl — (, respectively, 
such that, for any choice of channel matrix H £ R NrXL and 
integer matrix A £ Z L x L , the receiver can recover the linear 
combinations ui,..., u/, with probability of error at most £ 
so long as 


Re < 2 lo § + 


cr para(H, 3m, b m ) 


( 26 ) 


for all (m, £) such that \a m ,e\ mod p ^ 0 for some choice of 
equalization vectors bi,..., b/, £ □ 

Proof: First, we will select an ensemble of good nested 
lattice codebooks. Set a^s e = Pi2~ 2Re for £ = 

Invoke Theorem 8 with these cr'f ( 1 ,..., of f L and parameter 
e = f/L to obtain a generator matrix G £ lfjf xn and 
associated nested lattice codebooks Ci,. .., Cl with rates at 
least Ri — e ,..., Rl — e, respectively. Use Theorem 9 to obtain 
a linear labeling p : A F —> and its inverse (p : —>• Af- 

For £ = 1 generate an independent random dither 

vector according to d/ ~ UiiiffV'c.t) and make it available 
to the £th transmitter and the receiver. Each transmitter em¬ 
ploys (22) to generate its lattice codeword A i and channel 
input xr. From the Crypto Lemma, we know that x/ : is 
independent of A i and uniform over Vcj. Therefore, it follows 
from Theorem 8(b) that each transmitter satisfies its power 
constraint (1). 

For m = 1..... A, the receiver selects an equalization 
vector h m £ R^ 1 and generates an effective channel output 
y m via (25a). After subtracting an integer combination of the 
dither vectors, we obtain an integer combination of lattice 
codewords plus effective noise: 


L 

y m 'y ) tt m,e 

1=1 
L 

= y ) Q'm,e(?t-£ dr) -F Zp ara rn 
1=1 
L 

— y \ Hm,ei^e T dr Q a^^ (Ar T dr) dr) -t- Zp araim 
r=i 

L 

= y \ Om,rAr T Zp ara _ m 

e=i 


where 


Z para,m ~ ( b m H ~ a m) X + b m Z • 

The linear label of Ar satisfies 

<A(Ar) = 0 ¥ , (QA c ^(Ar + dr)) 

where 0 denotes subtraction over Z p . By Definition 11(a), 
since Q a c t (Ar + dr) £ Ac /, the last fcp — kc,e components of 


the label p[Qa c { (Ar + dr)) are zero. Therefore, p(Af) agrees 
with p(A r) on its last fc F — kc } e components, i.e., it may not 
agree on the first k— (fc F — kc e) = /c F — kc — & F T ^c,r = kc e~ 
kc components. This implies that <p(A r) £ [wr] as defined 
in (6). By Definition 11(b), p( Yht a m,e Ar ) =©r g m ,r<F(Ar) 
where q m _r = [ a m t r] mod p so the linear label can be viewed 
as a linear combination u m = p( a m ,t Ar) with integer 
coefficient vector a m . This also implies, via Definition 11(a), 
that 'ff J ia m ^Ai £ A F ,#( m ), which will be useful in the next 
step. 

Applying (25b), the receiver makes an estimate fi ln of the 
integer-linear combination n m . By Theorem 8(c), we have that 

F (£ m ^ R m ) < e if 

^paraCH, a m , b m ) < < J (27) 

where a m , b m ) is defined in (9). Afterwards, the 

receiver applies the linear labeling as in (25c) to obtain 
its estimate u m . If fi w = p, m , then u m = u m since 
p{Pm) = ^(Sr^/Ar) © p(QA c (J2e a m,eAi)) and the 
second term is equal to Or since it is a linear labeling of 
an element of Ac (i.e., the last k^ — kc = k components of its 
label are zero). 

Note that the condition (27) is equivalent to insisting that 
Rl < 2log + (Pr/^paialH^am^bm)) for all m and £ such 
that [a m /\ mod p ^ 0. Applying the union bound, we get 
that U m {U rn ^ < Le = £. Finally, it is argued in 

Appendix H that it suffices to use fixed dither vectors. ■ 

We are now ready to prove Theorem 1. 

Proof of Theorem 1: Choose rates I f and select 
nested lattice codebooks via Lemma 10. Each transmitter 
employs the encoding strategy from Lemma 10 (which does 
not depend on the channel matrix H nor the integer matrix 
A). Lor a given channel matrix H, say that the receiver wishes 
to decode linear combinations with integer coefficient matrix 
A where (i?i,..., Rl) £ Pcomp^H, A). This implies that 
there exists some integer matrix A satisfying rowspan(A) C 
rowspan(A) and (Ri,..., Rl) £ 72. pa i- a (H, A). The receiver 
applies the decoding strategy from Lemma 10 with inte¬ 
ger matrix A and optimal equalization vectors b opFm cho¬ 
sen via Lemma 2. Recall from (10) that o-p ara (H, a m ) = 
<7p ara (H, a m , b optjm ) so (26) matches the rate constraints from 
ft para (H, A). ■ 

VI. Successive Computation Achievability: Proof 
of Theorem 2 

In this section, we show how to improve the decod¬ 
ing process using successive cancellation, culminating in a 
proof of Theorem 2. The successive computation rate region 
7^-comp^ (H, A) involves a union over all integer matrices A 
whose rowspan contains that of A as well as a union over 
all admissible mappings A) for each A. As before, the 
union over integer matrices means that the receiver can first 
recover linear combinations with integer coefficient matrix 
A, and then solve these for the desired linear combinations 
with integer coefficient matrix A. Each admissible mapping 
I £ Af(A) corresponds to a specific successive cancella¬ 
tion order for the codewords, as we will explain in detail 
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below. We will begin by showing how to directly recover 
linear combinations with coefficient matrix A and admissible 
mapping X (for notational simplicity), and, in Lemma 13, state 
conditions under which the probability of error can be made 
to vanish in the blocklength. Afterwards, we will use this 
lemma to prove Theorem 2. Note that the decoding order for 
linear combinations is fixed to be lexicographic for notational 
convenience; other orders can be reached by exchanging rows 
of A (which is taken care of by the union over integer matrices 
that include the rowspan of A). 

We start with a high-level overview of our encoding and 
decoding process. As in Section V, we fix rates R\,... ,Rl 
and parameter e > 0 and then invoke Theorems 8 and 9 to 
select good nested lattices as well as a linear labeling ip and its 
inverse (p. Next, generate random dither vectors independently 
and uniformly over the Voronoi regions of the coarse lattices, 

d £ ~ Unif(Vc^). 

The encoding process is identical to that in the parallel 
computation case. We summarize the operations in (28) and 
refer the reader to Section V for a detailed discussion and 
Figure 8 for a block diagram. 


dithers dj,.... d/ , and the integer-linear combination //, = 
[ 12e a e^-e] mod Ac where ag is the fth entry of a and Xf = 
X e — Qa c , (Xf + df). Then, by calculating 


X = 


L 

M + df d^ 
1=1 


mod Ac 


s = <2Ac (y - x) + X , 


the receiver can obtain the integer-linear combination of the 
channel inputs, s T = a T X. 

Proof: By the distributive law. 


X = 


nr (a^+ df) 


f=i 

L 


mod Ac 


'y ' ae ^Af + df — Qa C i ( (Xf; + df 


i=i 

L 


mod Ac 


y (ifXf 
l=i 


mod Ac . 


(30) 


Therefore, 


Encoding: 


r ( 

0 k c ,e—kc 

\ 


w e 


[ \ 

0/cp— 

J\ 

Xf + 

d f] mod 7 

^c,e 


mod Ac,f 


(28a) 

(28b) 


The first decoding step (29a)-(29c) is quite similar to 
that of parallel computation (since there is not yet any side 
information to exploit). The decoder first recovers an integer- 
linear combination of lattice codewords //, as defined in (23) 
and then applies the linear labeling. Let 

d(ni) = argmax : t £ {1,..., L} s.t. (m, i) £ 1} 

denote the index of finest lattice that will participate in the 
mth integer-linear combination under the admissible mapping 
X. Note that A(l) = 9(1) since no successive cancellation 
takes place in the first round. 


Decoding, m = 1: 

y[ = b{Y 

At = Qa f ,* (1) (yi - X] a M d< 


mod Ac 


At = <f(Ai) 


Xi = 


Ai +^di,f df 


1=1 


mod Ac 


§i = QA c (yi - Xi) +Xi 


(29a) 

(29b) 

(29c) 

(29d) 

(29e) 


After making its estimate ui, the decoder attempts to re¬ 
construct the integer-linear combination of channel inputs 
a|X from Ai an d yi in (29d)-(29e). The following lemma 
characterizes when this process succeeds. 

Lemma 11: Assume a receiver has access to an observation 
of the form y T = a T X + zj s where a G Z L and z e g £ Vc, 


s T = <2Ac (y T - X T ) + x T 

= Q Ac ( aTx + z Jff - [ aTx ] mod A c ) + X T 

= Qac (Qa c (a T X) + zj s ^ + x T 

= <2a c (a T X) + [a T X] mod A c 
= a T X 

where (i) uses the fact that z e ff £ Vc as well as (30). ■ 

Thus, if Ai = Mil Lemma 11 will allow us to argue that 
sj = a|X, which can be used for successive computation 
as proposed by [16], i.e., creating better effective channels 
for subsequent linear combinations. In general, at the mth 
decoding step, we will have access to A m _iX where A m _i 
is the submatrix consisting of the first m — 1 rows of A, 
assuming all previous decoding steps are correct. 

The second ingredient in our decoding process is algebraic 
successive cancellation as proposed by [9]. The main idea is 
that, at decoding step m, it is possible to use linear combi¬ 
nations from steps 1 through m — 1, to cancel out some of 
the codewords participating in the integer-linear combination 
pL m without changing the effective noise variance. This in 
turn reduces the noise tolerance constraints placed on the fine 
lattices associated with the codewords and increases the overall 
rate region. Before we proceed, we need the following lemma 
that connects the definition of an admissible mapping to the 
existence of a matrix over 7L V that can be used for algebraic 
successive cancellation. 

Lemma 12: Let X be an admissible mapping for A £ 
Z ixl . For prime p large enough, there exists a lower uni- 
triangular matrix L £ such that, the (m,f) th entry of 

A = [LA] mod p is equal to zero (i.e., a m g = 0) for all 
(m, t) f X. Furthermore, L has a lower triangular inverse 
L(‘ nv ) £ Zp XL satisfying [A] mod p = [I2 mv) A] modp. 

Proof: By Definition 5, since X is an admissible map¬ 
ping, there exists a real-valued, lower unitriangular matrix 
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L £ R ixi such that the (to, £)th entry of LA is equal to 
zero for all (to, €) X. It follows from [9, Appendix A] that, 
for p large enough, there exists a lower unitriangular matrix 
L £ Zp XL satisfying the same criterion for A = [LA] mod p. 
Finally, since L is lower unitriangular, it has a lower unitrian¬ 
gular inverse over Z p . ■ 

An immediate consequence of Lemma 12 is that we can use 
preceding linear combinations to eliminate lattice codewords 
according to the admissible mapping. 


m—1 

Pm d" 'y ' ^Tn,iP 
2=1 

r L 

1=1 


mod Ac 
mod Ac 


(31) 


where l m i is the (m,*)th entry of L chosen via Lemma 12 
and a m f is the (to, f) entry of A = [LA] mod p. Similarly, 
using the inverse Ld nv ) of L, we can return to the original 
integer-linear combinations. 


m—1 

V inv) 
2=1 


mod Ac = p r 


(32) 


where is the (to,*) th entry of L/ lnv ). 

Overall, for m > 2, the mth successive decoding step 
begins by assembling the estimates of the integer-linear 
combinations of channel inputs from the previous to — 1 
decoding steps into a matrix S m _i. Assuming prior steps 
are correct, we have that S m _i = A m _iX. It then applies 
equalization vectors b m £ R^' and c m £ R m_1 to its 
observation Y and side information S m _i, respectively, 
and adds the results together to form its effective channel 
observation y m . Next, it applies algebraic successive 
cancellation, removes the dithers, and quantizes onto the 
appropriate fine lattice. After quantizing, it reverses the 
algebraic successive cancellation to obtain an estimate 
of the integer-linear combination fi m from (23). Finally, 
the decoder uses the linear labeling to make its estimate 
of the desired linear combination and follows the steps 
from Lemma 11 to estimate the corresponding integer-linear 
combination of the channel inputs. These operations are 
depicted in Figure 10 and formally expressed in (33). 


Decoding, m > 2: 

Sm-l — [si ' * ’ S m _i] 

Ym = b m Y + c m S m _i 


Vm 



F,i9(m) 



m— 1 L 

i= 1 £=1 



Am 

U m 

Xm 


m—1 

- i \ A ft inv) ~ 
v ™ + 2 ^ l m,i"i 
i=1 

v(A to) 

r L 

Am 3“ y ' a m,( 
1=1 


mod Ac 


mod Ac 


(33a) 

(33b) 

mod Ac 
(33c) 

(33d) 

(33e) 

(33f) 


§m = Q\ c (Ym - Xm) + Xm 


(33g) 


The following lemma captures the rates achievable for 
directly recovering the linear combinations with coefficient 
matrix A via successive computation. 

Lemma 13: Consider any choice of rates f?i,..., Rl and 
parameter ( > 0 and assume that the transmitters and receiver 
employ the encoding and decoding operations from (28), (29), 
and (33). For n and prime p large enough, there exists a 
generator matrix G £ Z p px " and corresponding nested lattice 
codes Ci ,..., Cl with rates at least R\ Rl — C 

respectively, such that, for any choice of channel matrix 
H £ l s,xi , integer matrix A £ Z LxL , and admissible 
mapping I, the receiver can recover the linear combinations 
ui,..., Ui with probability of error at most £ so long as 


Rf: < ^ lo g + 


Pf 


:(H, a. 


■m 5 bm, C m | A m _i) 


V(m,£) £ X 
(34) 


for some choice of equalization vectors h m £ E :V ' and c m £ 
R m_1 . □ 

Proof: The codebook generation process is nearly iden¬ 
tical to that in the proof of Lemma 10, except that we set 
e = C,/(2L) (and keep the choice cr^ ff ^ = Pe2~ 2Re ). The 
encoding process is also identical, and so the power constraint 
is met as in the proof of Lemma 10. Finally, the decoding 
step (29) to recover Ui is identical as well and, as argued in 
the proof of Lemma 10, ui = Ui with probability of error 
at most e. We condition on the event that Ui = Ui for the 
remainder of the proof. 

To show that §i = a} X, we need to argue that the 
effective noise (b^H — a)], )X + b^ Z is contained in the 
Voronoi region of the coarsest lattice Vc so that we can invoke 
Lemma 11. If at least one rate is non-zero, then we have 
that tjp ara (H, a m , b m ) < P max . From Theorem 8(c), it follows 
that the effective noise leaves Vc with probability at most e. 
Therefore, the decoder can recover a[X with probability at 
most 2e. 

We now proceed to argue by induction. Assume that de¬ 
coding has been successful for steps 1 through m — 1 with 
total probability of error at most 2 (to — l)e. We would like 
to argue that step m is successful with probability of error at 
most 2 me. Define the successive effective noise as 

^ succ, m - (b^H - a T )X + ds m _! + b^Z 
= (b^H + c^A m _i - a^JX + b^Z 


where the last step uses the correct decoding assumption, 
Sm-l = [si * • • S m _i] = A m _iX. 

The following equations show that the argument inside the 
quantizer in (33d) can be written as i/ m plus noise: 

r m—1 L -i 




mod Ac 


2—1 


1=1 


(0 


m—1 L 

Ym + ^ ^ ^ ^ ^m,£ 

2=1 i=l 


mod Ac 


- L m—1 

Y a ™^ Xf - d ^) + 55 m,iPi Z succ m 
- 1=1 i=l 


mod Ac 
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Fig. 10. Block diagram for the rath decoder for successive computation. 


(»*) 


(Hi) 


m — 1 


P'rn T ^ ^ T ^succ,r 


mod Ac 


— \ym T z SUC c,m] mod Ac 


where (i) uses the assumption that prior decoding steps 
are correct, ( ii ) uses the distributive law, and (in) follows 
from (31). Combining this with the nested quantization prop¬ 
erty from (19), we find that 


Qa f (l / m + ' 


mod Ac . 


Following a similar labeling argument as in the proof of 
Lemma 10, it can be shown that v m £ A F/1 >( m ). Thus, by 
Theorem 8(c), we have that P (O m i/ m ) < e if 


a mj b m , C m |A m _i) < (35) 

where of ucc (H,a m ,b m ,c m |A m _i) is defined in (12). Notice 
that, if €> m = u m , then by (32), u m = u m by 

the linear labeling argument in the proof of Lemma 10. If at 
least one rate is non-zero, then of ucc (H, a m , b m , c m |Am-i) < 
P max and we can apply Theorem 8 to establish that the effective 
noise z succ m leaves Vfc with probability at most e. Assuming 
this is the case, we can invoke Lemma 11 to show that 
sJn = aJ,X. To complete the induction step, we apply the 
union bound to show that P(u^L 1 {u m ^ u m }) < 2 me. After 
L decoding steps, we have probability of error at most 2 Le = £ 
as desired. Note that the condition in (35) is equivalent to re¬ 
quiring that Rt < §log + (Pe/ of ucc (H,a m ,b m ,c m |A m _i)) 
for all (to, /:) £ X. Finally, it is shown in Appendix H that it 
suffices to use fixed dither vectors. ■ 

We are now ready to prove Theorem 2. 

Proof of Theorem 2: Choose rates R\ ,..., Rl as well as 
nested lattice codebooks via Lemma 13. The transmitters 
use the encoding strategy from Lemma 13 (which does not 
depend on the channel matrix H nor the integer matrix 
A). For a given channel matrix H, assume the receiver 
wants linear combinations with integer coefficient matrix A. 
This implies that there exists an integer matrix A satisfy¬ 
ing rowspan(A) C rowspan(A) and admissible mapping 
X £ Ad(A) such that (_R 1; ..., R L ) £ 7?. SUCC (H, A,I). 
Therefore, the receiver can use the decoding strategy from 
Lemma 13 with integer matrix A, admissible mapping X, 
and optimal equalization vectors b optjTn and c optjm chosen 


via Lemma 6. Recall from (14) that of ucc (H, a m |A m _i) = 
cr s 2 ucc (H,a m ,b op t im ,c opt , m |A m _i) so (34) matches the rate 
constraints from 7^. SUC c(H, A,X). ■ 

VII. Conclusions 

Although compute-and-forward was originally proposed as 
a relaying strategy [1], recent efforts have demonstrated that 
it can be useful in the context of interference alignment [9]- 
[11] and low-complexity MIMO transceivers [3]—[8]. The 
aim of this paper was to create a unified framework that 
captures techniques (e.g., unequal power allocation, successive 
decoding) that are useful in these settings. Follow-up efforts 
have employed this expanded framework to develop a notion 
of uplink-downlink duality for integer-forcing [17] as well 
as investigate compute-and-forward for discrete memoryless 
networks [18], 

As mentioned earlier, the results of this paper are directly 
applicable to complex-valued channels by working with a 
real-valued decomposition of the channel, which corresponds 
to approximating the complex channel gains with Gaussian 
integers. However, recent efforts have demonstrated the ad¬ 
vantages of working directly over the complex field, e.g., by 
building lattices from Eisenstein integers [27], An interesting 
direction for future work is to generalize the results of Huang 
et al. [27] to create an expanded compute-and-forward frame¬ 
work for Eisenstein integers. More generally, it is of interest 
to generalize the algebraic framework of Feng et al. [14] to 
permit unequal powers. 

Another issue for future study is the development of op¬ 
timization techniques for use within the expanded compute- 
and-forward framework. It is well-established (see, for in¬ 
stance, [3], [14]) that the LLL basis reduction algorithm [88] 
and its variants are an attractive low-complexity solution for 
the problem of identifying good integer coefficient vectors 
within the original compute-and-forward framework. These 
techniques are a natural fit for the parallel computation strategy 
in Theorem 1. For the successive computation strategy in 
Theorem 2, it has been shown [7, Theorem 3] that the 
problem of minimizing the effective noise variances can be 
linked to Korkin-Zolotarev basis reduction [15]. This can in 
turn be approximated by multiple applications of the LLL 
algorithm. More broadly, applications such as integer-forcing 
interference alignment [10] require simultaneous optimization 
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of beamforming, equalization, and integer coefficient vectors. 
New heuristics and approximation algorithms are needed to 
characterize the performance of this strategy versus existing 
alignment strategies. See [89] for a recent approach that builds 
on uplink-downlink duality. 
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Appendix A 
Proof of Lemma 2 
Rewriting (9), we have that 

^paral^, 3 -mj b m ) = b m b m + (b m H b m — a m) 

= b^(l + HPH T )b m - aj n PH T b m 
- b^HPa m + aJ n Pa m , (36) 

which has a positive definite Hessian with respect to b m . 
Therefore, the minimizing value b„ pl m is found by setting 
the derivative equal to zero: 

2bl pt , m (l + HPH t ) - 2a^ PH t = 0 

b: pt , m = a;PH T (l + HPH T )- 1 . 

Plugging this back into (36), we get that 

min cd ra (H,a m ,b m ) 

b m 6R»> F 

= aj n (P PH t (I + HPH T )HP)a m 

from which the result follows by applying (8) and (7). ■ 

Appendix B 
Proof of Lemma 3 

Assume, for the sake of the contradiction, that a 2 n f > 
A ma x(l + PH t H) for some m and that there is a rate tuple 
in 72. para (H, A) for which Re > 0. Starting from the rate 
expression in Theorem 1, Re > 0 implies that 

Pi > CparatH^m) 

= aT(p- 1 +H T H) _1 a m 
= aTp(l + PH T H)“ 1 a m 
>aTpa ro A min ((l + PH T H)“ 1 ) 

pT Pp 

a m A 

~ A max (I + PH T H) 

> p ea 2 m ,e 
~ A max (l + PH T H) 

=► A max (l + PH T H)>^ 

where in (*) A m i„^(l + PH t H) ^ refers to the minimum 

eigenvalue of (l + PH T H) 1 and the inequality is due to the 
Min-Max Theorem [90, Theorem 4.2.2] for symmetric matri¬ 
ces. Thus, a contradiction has been reached, which establishes 
the desired bound on ■ 


Appendix C 
Proof of Lemma 6 
From (12), it follows that 

^"succ (^-7 ) b m , C m | A m —1) 

= bm(l + HPH T )b m - 2b^ HP(a m - A T m _ lCm ) 
( a m — c m A«i-l)P( a ™ — A-m — l C rn) ■ (37) 

This expression has a positive definite Hessian with respect 
to the vector [bj, cj,]. Therefore, the optimizing vector can 
be found by setting the derivative equal to zero. We start by 
solving for b opti m in terms of c opt , m . 

Taking the derivative of (37) with respect to b m and setting 
it equal to zero, we obtain 

2(1 + HPH T )b m - 2HP(a m - A^_ lCm ) = 0 . 

It follows that 

bj pt ,m = ( a m -Cl Am-OPHTp + HPHT)" 1 . 

Plugging back into (37) and canceling terms, we get 

^”succ(^’ bopt,m5 A m —l) 

= (a], - c J m A m _i) (P - PH t (I + HPHy'HP) (a m - A^_ lCm ) 

= ( a m - cJ l A m _ 1 )F T F(a m - Aj,_jC m ) • (38) 

where the last step uses (8) and (7). Next, we take the 
derivative with respect to c m and set it equal to zero, 

2A m -iF T FAm_ 1 c m - 2Am-iF T Fa m = 0 


from which it follows that 

c o P t,m = a mF T FA^_i(Am-iF T FA m _i) _1 . 

Finally, we plug back into (38) and cancel terms to obtain 


^succC^-’ b 0 pt: ? m, ^opt,m| A m _i) 

= a T (I - FAT_ 1 (Am- 1 F T FAT_ 1 )- 1 A m _iF T )Fa m . 

Note that FA^ (A m _iF T FA^ _ p ) ^m-iF 1 " is the pro¬ 
jection matrix for the subspace spanned by FAj,_j and 
Nm-i = I-FA^_ 1 (A m _iF T FA^_ 1 )" 1 A m _iF T )F is 
the projection matrix for the corresponding nullspace. Since 
projection matrices are idempotent (i.e, Nj,_i = N m -i) and 
N m _i is symmetric, it follows that 


^succC^j a m . b opLm .. 


Copt,m|A m —l) — a m F -N"m—lFclm 

= a(( l F T N^_ 1 Nm-iFa„ 
= ||N m _iFa TO || . 


Appendix D 
Proof of Theorem 3 

The inclusion 7\Lcomp' ) (H, A) C 7?.comp^ (H, A) follows di¬ 
rectly from the fact that the union in the former computation 
rate region is taken over a subset of the matrices A used in the 
union in the latter. We now turn to argue that 7^comp' ) (H, A) D 
7£comp*(H, A). In particular, we will show that for any integer 
matrix A, there exists a primitive basis matrix A pl -j m with the 
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same rowspan such that 7^ SUCC (H, A,I) C 7£ SUCC (H, A pnm ,X) 
for any admissible mapping I with respect to A. Without loss 
of generality, we assume that A is of the form 


A = 


A m 

P(L-M)xL_ 


(39) 


where the submatrix A m has M < L rows and is full rank. 
Lemma 14: For any integer matrix A of the form (39), there 

exists a rank M primitive basis matrix A pnrn = 

such that Am = TA pr j mj M where T is an M x M lower 
triangular integer matrix with strictly positive diagonal entries. 
□ 

The proof follows directly from [77, Corollary 1.24], 

Let A p ri m be a primitive basis matrix chosen using the 
lemma above. By assumption, there exists a lower unitriangu- 
lar matrix L that shows that I is admissible for A. Specifically, 
for each index pair (i,j) (f T, we have that ijaj = 0 where lj 
is the ?th row of L. We would like to show that I is admissible 
for A n ,j m as well. Define 


Aprim, M 
°(L-M)xL 


— /* ' CjEiprimj fm,m3prim,m 

7=1 

m —1 / m—1 \ 

= U,jC 0 p^m,i ~ tm,j j^primj — tm,m& pr im, 7 i 

j =1 ^ i=j ' 

m—1 /m—1 \ m 

= ]^prim,j — tm,j& prim ,7 

j =1 ' i=j ' j =1 

m—1 i m 

= C opt ,m,i E bj^primj E ^m,j 

j=l 7=1 7=1 

m—1 

W V- ~T ~T 


T 

trim ,j 


(V V- . _ . 

— / y c opt,m,i a i a m 

i=l 

A X -T 


— C opt,m-^-™-l *4 


m 


where (?) uses the fact that A m = TA pr i m ,M and that T is 
lower triangular. ■ 


Appendix E 
Proof of Theorem 4 


£ _ ^ T 0 Mx(L-M) 

[0(L-M)xM I L 

and note that L is lower triangular and ljaprimj = 0 where lj 
is the ?th row of L and a pnm) is the yth column of A pr i m . 
Finally, we renormalize the rows to obtain a unitriangular 
matrix L pr i m whose ?th row is equal to l prim i = 

We still have that lfo m = 0 so 1 is admissible for 

A ■ 

-^pnm • 

To complete the proof, we need to argue that the effective 
noise variances can only decrease by using A pl -i m instead of 
A. Let tij denote the (i,j)th entry of T and N p ri m , m -i the 
nullspace projection matrix (13) for A p ri m . Starting from (14), 
we have that 


^succC^i a m|A m -l) 

= mm ||b m || 2 + || (b^H - c^A.^! - a T m )P 1 / 2 || 2 

_ ^t>^ l|bm|| + || (b m H C A pr im,m—1 Ln.rn Uprim,m)^ ^ || 

— + II (b m H — c m Aprim,m-l '“4m,ma pr i m , m )P ^ || 

— ^succ (-^) fm,m a prim,m |A m _i) 

— 11 -Nprim.m— lFt m,m a prim,m 11 

(**) _ 2 

— 11-^prim,m—lL*a p rim,m 11 

— ^succ (bb a p rim ,m | Aprim,m— 1) 

where (?) relies on the fact that cj pt m A m -i — a J n = 
c T Aprim,m-l - for some choice of c which is 

shown below and (??) uses the fact that f mjm > 1 from 
Lemma 14. 

Let the jth entry of c be £j = U,j c opt,m,i - Ln,j 

where c op t,m,i is the ?th entry of c optim . It follows that 


2 Aprim,m—1 b 


i a prir 


Our proof follows along the same lines as the proof for the 
equal power setting [9, Theorem 3]. The following definition 
and theorem specialize basic results from the geometry of 
numbers [91] to the Euclidean case. 

Definition 12 (Successive Minima): Let A be a full-rank 
lattice in R L . For m = 1,..., L, the mth successive minimum 
A m of A corresponds to the radius of the smallest Euclidean 
ball centered at the origin that captures m linearly independent 
lattice points, 

A m — inf jr > 0 : dim span (B (0, r) fl A) = mj . 

The following lemma is a special case of Minkowski’s 
Second Theorem [91, p. 156]. 

Lemma 15 ( [15, Theorem 1.5]): Let F € R ixL be a full- 
rank matrix and let A = FZ L be the resulting full-rank lattice. 
The product of the successive minima is upper bounded as 
follows: 

L 

[] X 2 m < L L |det(F)| 2 

771=1 

Now, let Achannei = FZ L be the full-rank lattice generated 
by F from (7). Notice that the lengths of a dominant solution 
a],..., a* L correspond exactly to the successive minima of this 
lattice. Am = ||Fa*, ||. Furthermore, recall that the determinant 
of a lattice is defined as the absolute value of the determinant 
of any basis for the lattice [60, Definition 2.1.2] so that 
det (A c hannel) = dot(F) | . 

We now lower bound the sum of the rates from the theorem 
statement. 
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= \ l0 § ( II Pi ) ~ \ l0 § ( II CT para( H , <(/))) 

= ^ log det(P) - ^ log f H ||Fa^|A 

> ^ logdet(P) - i log (L L |det(F)| 2 ) 

= \ log det(P) - \ log det(F T F) - l - log (L) 

= ilogdet(P) - ilogdet((p- 1 + H t H)' 1 ) - | log(L) 
= 1 log det(I + PH t H) - | log(L) 

( = \ log det(l + HPH t ) - f - log (L) 

where ( i ) is due to Minkowski’s Second Theorem from above 
and ( ii) is due to Sylvester’s Determinant Identity [92]: for 
any square matrices Mi and M 2 , 

det(I + MiM 2 ) = det(I + M 2 Mi) . (40) 


Appendix F 
Proof of Lemma 8 


The following lemma will be useful for the proof. 

Lemma 16: For t = 1...., L, let A/ G Ap be lattice 
codewords with corresponding linear codewords (j>(\e)- Then, 
for any integer combination fi = a i^i where ay G Z, 

the corresponding linear codeword </>(/r) satisfies 

L 

<MfO = 

£=1 

where qy = [ay] mod p. □ 

Proof: 


<MfO = <t> 



L 

7 ~ l P Y, 
£—1 


mod p 


= [ 7 _1 p a iAi] modp ® ••• ® [ 7 _1 paiA i ] rnodp 
= [ai] modp • [ 7 _1 pAi] modp® 

• ■ • © [cll\ mod p ■ Y)~ 1 p\l\ mod p 
= q\ 4>( Ai) ffi ■ • • ® Ql </>(A l) 


First, note that the vectors Fai,.... Fa/, form a basis of 
R l since F and A have rank L. Denote the Gram-Schmidt 
orthogonalization of this basis by g*,..., g£ where 

Sm = lFa m 


and N m _i refers to the projection matrix for the nullspace of 
FA^_j defined in (13). Define G* = [g] • • • g£] T . It can 
be shown [77, Theorem 3.4] that det(G*) = det(FA T ). 

Since the Gram-Schmidt vectors are orthogonal, we have 
that 

L L 

det(G*(G*) T ) = [] ||g^|| 2 = «7 2 ucc (H,a m !A m _ 1 ) . 

m= 1 m— 1 

It follows that 

y I log f^ 

V^cclH^yArM-t)/ 

= \ lo § ( n p t) - \ !°g ( n <4cc( H » a m |A m _i)") 

' 1=1 ' ' m=l ' 

= i log det(P) - i log det(G* (G*) T ) 

= ilogdet(P) - ilogdet(AF T FA T ) 

= ilogdet(P) - i log det ((P- 1 + H T H) _1 ) -logdet(A) 
= ^ log det(l + PH t H) 

( = ^ log det (I + HPH t ) 

where (i) uses the fact that A is unimodular so det(A) = 1 
and (ii) uses Sylvester’s Determinant Identity from (40). ■ 


We now establish that the proposed labeling ip satisfies 
Definition 11(a). Recall that A G Ayj if and only if the 
corresponding linear codeword </>(A) G Cf,g Let v G Zp F 
be the unique vector satisfying <j>(A) = G T v. If f( A) G Cy t y, 
then the last ky — ky p components of v must be equal to 0, 
meaning that the last ky — ky^ components of the label </?(A) 
are equal to 0. Similarly, A G Ac,f if and only if the last 
ky — kc,e components of the label p(X) are equal to 0. 

Now we turn to establish that <p satisfies Definition 11(b). 
Let p, = XAi For each A^ G Af, let vy G Zp F 

denote the unique vector that satisfies 4>(Xy) = G T v/ : . From 
Lemma 16, we have that 

L 

<Km) = 0 <U 

e=i 

L 

= 0 <U G T vr 

£=1 

L 

= G T qevg 

e=i 

where qt = [a^] mod p. Let t G Zp F be the unique vector 
satisfying <j>(/x) = G T t. Since G is full rank, it follows that 

L 

t = 0 ■ 

e=i 

Finally, since the labels p(n) and p(\t) consist of the last k 
elements of t and v^, respectively, we have that 

L 

<f(m) = 0%^( Ar) 

1=1 

as desired. ■ 
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Appendix H 
Fixed Dithers 


Assume that we have established Lemma 10 or 13 using 
random dither vectors. Specifically, each dither vector dp is 
independently generated according to a uniform distribution 
over V/ ; . We would like to establish that there exist fixed dither 
vectors that can achieve the same rate region. Select any power 
constraints, channel matrix, and integer matrix as well as any 
rate tuple in the achievable region. Let xr(w^, dp) denote the 
channel input sequence for message vector wp £ if' 1 kc,e 
and dither vector dp £ Vcj:.. For a given realization of df, the 
average power is 

Pavg.r(dr) = k J_ kct ^||x,(w,,d ,)|| 2 

^ W£ 

where the average is taken with respect to a uniform dis¬ 
tribution over possible message vectors. Let 0 < 7 < 1 
be a parameter to be specified later. Using the fact that 
E[P av g 7 '(d/>)] < Pp, it follows from Markov’s inequality that 

p(p.„,«(d«) < Yzrpi) 

Using the independence of the dithers, we thus have that 

I*(p {/>..«,<(<!,) <^^1) >7- (41) 

For a given realization of di,..., dj,, the average probabil¬ 
ity of error is 


Perror (dl, , , . , dr,) 

1 


pile kp'(—kc,e 


l(fU 7 ^ W for some t) 


where the average is taken with respect to a uniform dis¬ 
tribution over possible message vectors. Using the fact that 
E[pen-oi-(di)... , d^)] < e, we have from Markov’s inequality 
that 


Perror(dl, • ■ • , d^) <C 


> 1 - 


7 


Combining this with (41), we know that with probability 7 / 2 , 
each power is at most l fi/ L times larger than its original 
target Pp and the average error probability is at most 4^- 
Therefore, there exist fixed dither vectors that satisfy these 
relaxed constraints as well. By taking 7 to zero, we can get as 
close as needed to the original target powers Pp. Afterwards, 
we can make the average probability of error as small to be as 
desired by choosing e. Finally, since the rate expressions are 
continuous functions of the powers, we can operate as close 
as we would like to any rate tuple in the original rate region 
using fixed dithers. 
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